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MAGELLANIC CLOUDS. I. TRANSPARENCY 
By HARLOW SHAPLEY 
HARVARD COLLEGE OBSERVATORY 
Communicated January 10, 1951 


Several special studies of the Clouds of Magellan have been undertaken 
in the past year as a part of a monographic treatment of these nearest of 
external galaxies. One such investigation is the examination of the extent 
to which they are transparent to the light of more distant stellar systems. 
They are sufficiently large in angular extent that a deficiency in the popula- 
tion of faint background galaxies is a satisfactory indication, and a rough 
measure, of the internal space absorption. The amount of gas and dust 
now in the Magellanic Clouds is obviously of importance to speculations 
on their origin and development. A few of the larger spirals also should 
similarly reveal their total absorption, at least in their uncrowded outer 
regions, when searched for background galaxies to the twenty-first magni- 
tude and fainter. 

For the Bruce refractor, with which the Harvard survey of southern 
galaxies has been made, the scale is too small to differentiate clearly in many 
instances the distant translucent galaxies in the fields of the Clouds from 
their own numerous nebulosities, star clusters, and nebulous groups. But 

-the scale of the 60-inch reflector, 1 mm. = 2676, suffices to clarify many of 
the doubtful objects. 

The whole of both Magellanic Clouds and some of the surrounding areas 
have been covered with reflector plates, mostly of thirty minutes’ exposure 
on fast emulsions. Only the central part with undistorted images— a fifth of 
a square degree—has been searched on each plate for external galaxies. Be- 
cause of the large zenith distances of the Clouds, even when on the meridian 
(z = 44°, and z = 40° for Small and Large Cloud, respectively), many of 
the plates are of low quality. For the Small Cloud, therefore, only 60 
of the better plates, covering a total of 12 square degrees, have been used 
in this examination; for the Large Cloud, 122 plates are used, covering 
24.4 square degrees. Reflector plates of comparable exposure in regions 
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well outside the Clouds provide something of a check on the surety of 
identifying faint galaxies and on their frequency, but the total area out- 
side the Clouds covered with the reflector plates is too small to justify a 
direct quantitative comparison with the populations in the Clouds. For 
the quantitative comparison of the Clouds with the neighboring fields, 
the large-area Bruce plates are used. They also serve for the estimates of 
total magnitudes of the galaxies. 

The comparison fields around the Clouds are selected from a published 
catalog of galaxies from declination —60° to the South Pole.'| They are 
chosen in such a manner (table 1) that the mean galactic latitudes and lon- 


TABLE 1 
COMPARISON FIELDS 


NUMBER 
PLATE GALACTIC———-> oF 
FIELD NUMBER LONG, ° ar. * GALAXIES 


A14232 273 —5O 
17198 268 —40 
17222 —50 
17150 —45 
14395 
14213 —48 
16299 —44 
17194 —40 

Means —45 


Small cloud 


~ 


CNS & b 


Large cloud 1 17155 

17163 —38 

17187 —39 

16686 —34 27.2 

16732 5 —37 60.9 

16641 —30 15.4 

16368 —28 15.2 

16758 —26 18.6 
9 14256 —25 18.1 

Means —33 30.3 


= 


gitudes fall within the Clouds they surround. Each plate covers nine square 
degrees. From the last column of the table, which records the number of 
galaxies per square degree brighter than magnitude 17.5, it is found that 
the surroundings of the Small Cloud are considerably below normal popula- 
tion (V = 21), and around the Large Cloud, somewhat richer than the 
average high latitude field. Also, for both the Clouds, the surrounding 
fields are of uneven population; in consequence this test of the transpar- 
ency does not have high quantitative value. 

The excess of galaxies in the neighborhood of the Large Cloud has long 
been known. The Cloud lies, in fact, near the end of an extensive meta- 
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galactic cloud of faint galaxies, 
that extends from the Large 
Cloud some forty degrees or 
more in the direction of the 
South Galactic Pole.? 

The poorness of the environ- 
ment of the Small Cloud, not- 
withstanding its high galactic 
latitude of —45°, is somewhat 
surprising. The question nat- 
urally arises as to whether in 
this direction there is a cloud 
of interstellar material, per- 
haps near the observer, that 
dims the light of the galaxies 
by half a magnitude. That 
possibility, aowever, is ruled 
out by the star counts which 
show, in the van Rhijn tables, 
for instance, a stellar density 
around the Small Cioud that 
is not at all below average for 
the galactic latitude. A block- 
ing dust cloud outside our 
Galaxy, in intergalactic space, 
which would dim galaxies but 
not stars, would need to be of 
unacceptably large dimensions 
and mass. It seems best to 
interpret the scarcity of galax- 
ies in the general region of the 
Small Cloud to a detail of 
metagalactic structure. There 
are similar sparse regions else- 
where in both the northern 
and southern skies. Long ex- 
posure photographs around 
the Clouds with the Baker- 
Schmidt telescope in South 
Africa, which will take the 
nebular counts to the nine- 
teenth magnitude or fainter, 
will eventually clarify the 
situation. 
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In table 2 the results from the reflector plates are assembled. The Small 
Cloud is divided into an inner and outer region; the Large Cloud is ex- 
amined separately for four regions—the axis; the clustered areas, which 
are rich in giant clusters, smaller nebulous groups, and bright and dark 
nebulosity; the open regions (stellar deserts) inside the main body of the 
Cloud; and the open regions that lie around the border of the main body 
but within the area where outlying Cepheids and clusters are found. The 
cosmograph in figure | indicates the location of these four regions.* 


71 


FIGURE 1 


The cosmograph of the Large Magellanic Cloud 
shows in the main body of the system, the Axis and 
the coarsely cross-hatched clustered regions, as well 
as the more thinly populated deserts where also the 
translucent galaxies are dimmed by approximately a 
magnitude. 


The numbers of plates are given in the second column of table 2—reflec- 
tor plates for the Clouds, Bruce plates for the comparison fields. The areas 
of the various regions and the number of galaxies per square degree to suc- 
cessive limits of magnitude are given in the following columns. For the 
comparison fields the mean errors refer not to square degrees but to the 
individual plates, each of nine square degrees area. 

Conclusions.—An examination of the tabulated numbers and of figure 2 
shows that the Small Magellanic Cloud is essentially transparent. For the 
translucent galaxies brighter than magnitude 17.25 both the inner and outer 
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regions show a population closely comparable with that in the eight sur- 
rounding large areas, but the numbers of objects involved are small. For 
the magnitude interval 17 to 18, where there would naturally be some dif- 
ficulty in finding translucent galaxies in the core of the Cloud because of 
the relatively high star density, the inner population falls below that of the 
outer region of the Cloud, which in turn seems to be a little more populous 
than the mean of the comparison fields. An interstellar absorption in the 
inner section of the Cloud of two- or three-tenths of a magnitude is not out 
of the question. There are several small nebulous wisps in the central 
part of the Cloud and probably some scattering dust associated with 
these bits of nebulosity. 

The observations in the Large Cloud region, in table 2 and figure 3, 
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Distribution of galaxies in and around the — Distribution of galaxies in and around the 
field of the Small Magellanic Cloud. field of the Large Magellanic Cloud. 


give a conspicuously different result. The crowded axis is nearly devoid of 
translucent galaxies. Galaxies are also scarce in clustered areas, and, sur- 
prisingly, in the open areas in the middle of the Cloud. On the borders of 
the main body of the Cloud, the frequency is about the same as on the 
four sparsely populated comparison field plates, which show a population 
much like the average for high latitudes the whole sky over. 

From these data we conclude that space absorption apparently can be 
ignored on the borders of the Large Cloud, but amounts to as much as a 
magnitude at some places inside. The presence of much visible nebulosity 
and also of very many supergiant stars is consistent with this result on 
opacity, and suggests that the birth of low-density high luminosity stars 
may be a current operation in the Large Cloud, whereas in the essentially 
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dust-free Small Cloud the age of star birth is past. This hypothesis rests, 
of course, on the assumption that under appropriate conditions the inter- 
stellar medium can aggregate into new stars of vast extent. 

The two Clouds also differ in certain properties of their Cepheid variables. 
In a forthcoming paper this difference will be related to the transparency 
peculiarities. 

' Harv. Ann., 105, No. 8, 140-141 (1937). 

2 These PROCEEDINGS, 24, 282-287 (1938); Harv. Rep. 150 (1938); Ibid., 24, 527~ 
531 (1988); Harv. Rep. 154 (1938). 

§ Harv. Obs. Bull. 881, 1 (1931). 


MAGELLANIC CLOUDS. II. SUPERGIANT RED VARIABLE 
STARS IN THE SMALL CLOUD 


By HARLOW SHAPLEY AND VIRGINIA MCKIBBEN NAIL 
HARVARD COLLEGE OBSERVATORY 


Communicated January 17, 1951 


How bright and how big can stars be? This is a question for which the 
nearer of the external galaxies can provide a precise answer. Both the 


relative and absolute values of the total outflow of energy of super-lumi- 
nous stars of various types, in any discrete stellar system, can be found 
simply from measures of their apparent magnitudes, once the distance 
modulus has been determined from Cepheid variables. (The interstellar 
dimming in our own Galaxy and between galaxies does not enter the prob- 
lem.) From the luminosities and colors we can also compute the corre- 
sponding gigantic sizes of these brightest of stars in external galaxies. 

Of the nearer galaxies, the Large and Small Clouds of Magellan are of 
course the most useful for an inquiry into the frequency and behavior of 
supergiants. These galaxies are rich in various types of stars and struc- 
turally are wide open to photometric and spectroscopic surveys, even with 
telescopes of intermediate size. The high luminosity stars in our own gal- 
axy, on the other hand, cannot be comprehensively surveyed. With us 
the uncertainties in distances, the uneven dimming by space absorption, 
and the incomplete census in remote regions prevent the making of a good 
survey. 

1. Supergiant Luminosities.—In the present communication we report 
on high-luminosity reddish variable stars of the Small Cloud, listing all the 
known long-period variables and irregular or semiregular stars brighter 
photographically than absolute magnitude —2.3 at maximum.* A later 
study of these types of variables will be based on spectroscopic studies. 


3 
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2.—The brightest star in the universe so far as now recorded is the well- 
known bluish variable star of the Large Magellanic Cloud, S Doradus. 
Its absolute magnitudet appears to be about —9 (half a million times the 
Sun’s luminosity), and its brightness is therefore exceeded only by the 
bursts of super-novae. Doubtless there are similar superlatively bright 
stars in the open spirals and in the more massive of the irregular galaxies. 
In the Small Cloud no variable is so bright. Among the Cloud’s constant 
stars there are, in our records to date, seven of spectral class B with absolute 
magnitudes ranging from —6.8 to —5.7, two Class O stars with absolute 
magnitudes of about —5, and one P-Cygni star with absolute magnitude 
—6. (For our galactic system the absolute magnitudes of the irregular 
TABLE 1 
PERIOD IN RADIUS 
DESIGNATION TYPE DAYS IN A.U 
HV 1956 Cepheid 209 
821 Cepheid 127 
824 Cepheid 65. 
834 Cepheid 73. 
2195 Cepheid 41. 
829 Cepheid 
11423 Irregular 
1475 Irregular 
11157 Cepheid 
863 Cepheid 
837 Cepheid 
847 Cepheid 
865 Cepheid 
840 Cepheid © 
817 Cepheid 
1541 Cepheid 
1652 Irregular 
1636 Cepheid 
2084 Irregular 
2064 Cepheid 


wow 
oo 


red variables have been discussed by Joy,' and by Wilson,’ and of the long- 
period variables by Wilson and Merrill.* The highest luminosities at 
maximum do not exceed photographic magnitude —3.0 and very few ex- 
ceed —2.0.) 

3.—In the recognized membership of the Small Cloud the twenty 
brightest variables are those of table |. HV 1956, at the top of the list, 
may of course be a foreground star, but it conforms precisely with the 
period-luminosity relation. 

4. Supergiant Sizes.-Not only the greatest radiators but also stars of 
largest known dimensions are revealed by the survey of reddish super- 
giants in external galaxies. The long-period variables, the supergiant 
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Cepheids, and the reddish irregular and semiregular variables of the Small 
Cloud all have low radiative efficiency, and therefore those of highest abso- 
lute luminosity have surface areas far in excess of the largest stars of similar 
types yet measured in our galactic system. To illustrate the sizes, let us 
take c.i. = +1.5 magnitudes as a reasonable average value for the color 
index of the irregular red variables of the Small Cloud, which resemble 
Betelgeuse in variability and in color, and presumably in surface brightness. 
Then the visual absolute magnitudes of the five brightest in table 5 (omit- 
ting HV 11546, which may be superposed) average V, = —4.0 — ci. = 
—5.5 at maximum. They are therefore AM = 2.0 magnitudes brighter 
absolutely than the mean of Betelgeuse and Antares,‘ and their radii must 
be approximately 10°74” = 2.5 times as great. Such stars would fill the 
orbit of Jupiter. In the Large Cloud we shall find larger and brighter red 
stars. Probably such supergiants, though yet undiscovered, exist also in 
our own galactic system. 

Comparing directly with Betelgeuse, of absolute magnitude —3.9 (vis.) 
and a radius r, of two astronomical units, and assuming a color index of 
+1.5, we compute the radii r, = r,10°? *” of the twenty stars in table 1 
and obtain the values, in astronomical units, given in the last column. The 
mean is 4.9; the radius of Jupiter’s orbit, 5.2 astronomical units, is exceeded 
by the radii of seven of the variables. 

5. The Comparison Fields.—The large angular dimensions of the Magel- 
lanic Clouds make necessary a special investigation before actual member- 
ship rather than random superposition can be assigned to the variable 
stars. For some types of stars, however, the high galactic latitudes of the 
Clouds assist in the discrimination between Cloud members and superposed 
stars of the galactic system. The high concentration toward the galactic 
plane of classical Cepheids, novae, open star clusters, bright nebulosities, 
and stars of spectral classes B, O, and P-Cygni, gives assurance that when 
such objects appear on photographs of the Clouds they are actual Cloud 
members. On the other hand, for the eclipsing binaries of most types, long- 
period and irregular variable stars, all of which have a wide distribution in 
galactic latitude, we need to make a closer study. Two methods of dis- 
criminating are available; the radial velocities, and the intercomparison 
of populations in and out of the Clouds. The radial velocities of the Clouds 
are high and are rarely equalled by galactic variables; but the apparent 
faintness of the stars in the Clouds has up to now excluded the use of the 
spectrographic test on the variable stars. We use the other method. 

Fourteen comparison fields, each of about 75 square degrees, have been 
thoroughly examined for variable stars. Table 2 lists the fields and for 
each the number of variable stars of the long-period and irregular types, 
in four intervals of magnitude. For the work on the Small Cloud (8 = 
—45°) we have used only the six fields of similar latitudes at the top of the 
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list, two of which, between the Clouds, somewhat overlap. About 400 
square degrees are covered by the six fields, and 1000 square degrees are 
covered by all fourteen. The Small Cloud variables fall within an area of 
25 square degrees. Hence the subtotals and the totals of table 2 are di- 
vided by 400/25 = 16 and 1000/25 = 40, respectively, to give the popula- 
tion of foreground variables, that would be expected on the basis of the 
comparison field surveys in the area covered by the Small Cloud. 

In itself, this census of high latitude fields has some points worth noting. 
For example, long-period variables do not appear fainter than the four- 
teenth magnitude, with the exception of a single variable in the field of 
lowest latitude (No.-356). In fact, except in that field, there are only six 
long-period variables fainter than magnitude 13 in more than 900 square 
degrees. In the seven polar cap fields (latitudes 60° or higher, area 525 


TABLE 3 
PROBABLE CLOUD MEMBERS 


DISTANCE 
HARVARD MAXIMUM MAGNITUDE ABSOLUTE PERIOD FROM 


LONG-PERIOD VARIABLES: 


VARIABLE MAGNITUDK RANGE MAGNITUDE IN DAYS CENTER COLOR 
859 14.3 ]2.7 —-3.0 573.4 Red 
1719 14.5 ]2.4 —2.8 531.9 0.5 Red? 
2112 14.2 —3.1 607.2 1 Very red 


Red? 


740.7 


12149 


1375 5. ~1, 510 Red 
1645 16.3 —1.0 300.3 0 Yellow? 
1865 15.7 ]2.3 —1.6 562.4 0.6 Red 
11295 15.0 1.8 —2.3 571.4 0.4 Yellow 
11303 15.7 }1.3 —1.6 540.0 1.0 Red 
11329 15.2 1.9 —2.1 388.0 0.3 Very red 


254.4 Red 


11427 


square degrees) there are only four long-period variables fainter than the 
twelfth magnitude at maximum. In other words, the long-period vari- 
ables, of which hundreds are known, are rarely found more than a thousand 
parsecs from the galactic plane. Irregular variables are found more numer- 
ously among the fainter stars. 

From table 2 we conclude that any long-period variable fainter than 12.0 
at maximum, appearing on Cloud photographs, is almost certainly a mem- 
ber of the Cloud. Brighter than the twelfth magnitude we might expect 
one superposed long-period variable and one irregular variable. Possibly 
in the magnitude interval 12-14 one or two superposed irregular variables 
might be expected but, fainter than 14, probably nothing from the fore- 
ground, 

In summary, we find from the study of fourteen comparison fields in 
similar galactic latitudes that practically every reddish variable star of 
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these two important types fainter than 11.0, discovered on the photographs 
of the Clouds, are actual Cloud members and therefore giants and super- 
giants. 

6. Long-Period Variables.—-In 1923 the three brightest variable stars 
in the southern globular cluster 47 Tucanae were shown to be long-period 
variables of extraordinarily high luminosity at maximum. Their crowding 
toward the center of the cluster argued against the possibility of their being 
superposed galactic stars. The long-period variable stars later detected 
in the adjacent Small Cloud could not be assigned so definitely to the super- 
giant category until the completion of the comparison field work described 
in the preceding section. The assignment is now possible. Eleven of the 
bright long-period variables of the Small Cloud region are actually members 
of the Cloud. The brightest four, 1f/ = —2.9, are about four magnitudes 
brighter absolutely than the average long-period variable in the galactic 
system, but little if any more luminous than the brightest of the galactic 
long-period variables.* Table 3 contains information on all the long-period 
variables now known in the Small Cloud, with the absolute magnitudes 


TABLE 4 


FOREGROUND LONG-PERIOD VARIABLES 


MAXIMUM PERIOD 

NAME MAGNITUDE RANGE IN DAYS SPECTRUM 
HV 860 10.0 6.0 215.5 M3e 
TZ Tuc. 9.8 13.7 228.8 M0e 
U Tue. 8.5 6.7 262.3 M7e 
HV 864 11.0 4.5 319 MSe 


based on the assumption of Cloud membership. The greatest distance 
from the geometrical center of the Cloud is 1°2. Therefore the eleven 
variables are crowded into less than 5 square degrees. 

7. It should be noted that eight of the periods are unusually long, and 
for at least two of the variables the ranges are small. Although uncommon 
in range and period similar variables are found in the galactic system.® 

In the vicinity of the Small Cloud are four long-period variables of typical 
range, period, and spectrum; all are brighter than the eleventh magnitude 
at maximum (table 4). Presumably they are members of the galactic 
system, at a distance of only a thousand parsees or so, and therefore too 
near and too bright for us to estimate accurately either distance or absolute 


magnitude! 
8. Irregular and Semiregular Variables.—In the first part of table 5 are 


listed the irregular and semiregular variable stars of the Small Cloud for 
which the apparent magnitudes at maximum are 14.9 or brighter. In the 
lower section are listed the other ten fainter variables of these kinds that 
have been sufficiently examined to determine their characteristics. As for 
the long-period variables, the distance from the center and the color are 
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given along with data on magnitudes. Four of the stars, marked with as- 
terisks, had not heretofore been published. Their positions, in the usual 
coordinate system,’ are as follows: 


DESIGNATION 
11235” 
11330 
12002 
12192 


TABLE 5 
IRREGULAR AND SEMIREGULAR VARIABLES 


DISTANCE 
HARVARD MAXIMUM ABSOLUTE 
VARIABLE MAGNITUDE RANGE MAGNITUDE NOTES 


825 4 0.7 —2.9 (1) : Very red 
832 2 0.6 —3 (1) : Very red 
838 —3. 5754 Red 
1455 0.7 (1) Very red 
1475 0.7 —4, (2) : Very red 
1652 ‘ 3. Very red 
2084 3. 3. (1) : Very red 
2232 : (1) Very red 
11262 —2.6 Slow Very red 
11280 ¢ Red 
11378 Slow ; Yellow 
11401 ; Yellow 
11402 3. d Very red 
11465 Slow ; Very red 
11470 f 6 —2 Slow j Yellow 
11546 y 2.6 Red 
6 —2.§ Slow Blue 
3.3 Slow 6 Red 
ke Slow j Red 
1432 Very red 
1586 His Red 
1644 (3) Red? 
1722 Very red 
1963 Red 
2105 Slow Very red 
11417 1000 + (3) a Very red 
11455 Very red 
12122 Red 
w* 


750 (3) 


te 


‘ 


* Found by Miss Henrietta H. Swope on Columbia-Yale plates. Type and color by 
Miss Swope; magnitudes and type verified by Mrs. Nail with Bruce plates. 

(1) Hoffleit, D., Harv. Obs. Buil., 900 (1935). 

(2) Period of 61945, Harv. Circ., 280 (1925) could not be verified. 

(3) Heights of maxima differ from cycle to cycle. 


8426" 
8860 
8212 
8310 

| 
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As indicated in the fifth column of table 5 under “Notes,” a number of 
these irregular variables exhibit slow variation, and for four of them an ap- 
proximate length of the cyclic variation is found. The eleventh magnitude 
variable HV 11546 exceeds all others by more than a magnitude at maxi- 
mum. It may be a foreground star. It was omitted in the listing of 
brightest luminosities in section 3. 

9. Summary.—Luminosities as high as VM, = —5.0 and diameters 
in excess of eight astronomical units are found in the survey of the super- 
giant red variable stars in the Small Magellanic Cloud. All the brightest 
irregular and long-period variables (21 and 4, respectively, brighter than 
My, = 15.0) are discussed in this paper; the long period Cepheids have 
been treated in various earlier reports. Through the study of the variable 
star population in comparable latitudes an estimate of the number of 
superposed foreground variables of all types is possible. Essentially all 
red variables are certainly Cloud members, as are the classical Cepheids. 
The census of the fourteen comparison star fields shows that typical long- 
period variables will rarely if ever be found in latitudes higher than + 40° 
if fainter than magnitude 14.0. 


* The modulus is taken as 17.3 in the following discussion, and the magnitude scale 
is that set up with the Bruce refractor. The forthcoming revision of scale and zero- 
point through Mr. Ivan King’s photoelectric measures will somewhat modify these 
values but not affect appreciably the arguments based on them. 

+ All magnitudes referred to in these papers on the Magellanic Clouds are photographic 
unless otherwise indicated. 

' Joy, A. H., Mt. Wilson Contr., 668 (1942); Ap. J., 96, 344-370 (1942). 

? Wilson, R. E., Mt. Wilson Contr., 669 (1942); Ap. J., 96, 371-381 (1942). 

3 Wilson, R. E., and P. W. Merrill, Mt. Wilson Contr., 658 (1942); Ap. J., 95, 248- 
267 (1942). 

‘Van de Kamp, P., Pop. Ast., 58, 404 (1940). 

5 Our work on this highly luminous group of long-period variables was first reported 
at the Ottawa meeting of the American Astronomical Society, in June, 1949; Abstract, 
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PERIODIC SELECTION IN ESCHERICHIA COLI*+ 


By K. C. Atrwoop, LILLIAN K. SCHNEIDER AND FRANcIsS J. RYAN 


DEPARTMENT OF ZOOLOGY, COLUMBIA UNIVERSITY 
Communicated by Th. Dobzhansky, January 2, 1951 


Spontaneous mutations in bacteria are generally reversible. Conse- 
quently, in a medium where prototrophy confers no selective advantage, 
various auxotrophs (and other mutant types) should accumulate until for 
each type the number is such that the loss by mutation to prototrophy 
should equal the gain by mutation to the corresponding auxotrophic con- 
dition. That is to say, mutational equilibria should be established. 

Nevertheless it is common experience that prototrophic bacterial popu- 
lations continuously maintained on complex media remain essentially proto- 
trophic and contain at any time a very small proportion of auxotrophs. 
Conversely, under the same conditions, populations started from an 
auxotroph remain essentially auxotrophic and contain at any time a very 
small proportion of prototrophs. The forward and reverse mutation rates 
for any pair of alleles can lead to only one equilibrium, yet everyday ex- 
perience shows us that two stable equilibria exist for each known mutant 
in bacteria. We may conclude therefore that for each mutant at least one 
of the equilibrium ratios cannot be attributed to a balance of mutation 
rates. 

The above considerations serve to emphasize the ubiquitous role of selec- 
tion in controlling the stability of bacterial populations which would other- 
wise become extremely heterogeneous as a result of mutation pressure. 
Perhaps it is not too surprising then, that we have found as a result of 
experiments to be described below that there exists in Escherichia coli a 
mechanism for stabilization of the major component of the population by 
the simultaneous suppression of all mutants. 

Materials and Methods.—-The four initial stocks of /. coli (strain 15) 
used in these experiments were histidine requiring, lactose non-fermenting 
(h— lac—); histidine independent, lactose non-fermenting (A+ lac—); 
histidine requiring, lactose fermenting (4— lac+); and histidine independ- 
ent, lactose fermenting (4+ lac+). Stocks were kept on nutrient agar 
slants transferred every three months, grown for 36 hours at 37°C., then 
kept refrigerated between transfers. The main experimental procedure 
was the serial transfer of populations comprised of various initial ratios of 
the above stocks through Gray and Tatum synthetic medium containing 
25y of L-histidine monohydrochloride per cc. (/7+). The growth rate and 
final level of growth of h— and A+ are indistinguishable on //+.' The 
serial transfer experiments (STs) consisted in transferring 0.5 ce. of cultures 
in stationary phase to 50 ec, of the above medium in a 125-cc. Erlenmeyer 
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flask. This was incubated for 12 hours at 37°C., then used as the inoculum 
for the next flask. The final level of growth in each flask was 2.5 « 10" 
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Increase of h+ to equilibrium. A, composite of eight ST experiments. 
B, data from a single ST. 
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FIGURE 2 


Decrease of 4+ in four experiments started with various ratios 
above equilibrium. Arrow A indicates point of isolation of com- 
ponents used in reconstructions shown in figure 4 (A), and arrow 
B of those used in figure 4(B). Broken line indicates equilibrium 
level. 


bacteria. Thus each flask was started with an inoculum of 2.5 X 10° bacteria 
which grew for 6.64 generations, then remained in the stationary phase some 
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five hours until the next transfer. From time to time samples were taken 
from fully grown cultures to determine their composition. These samples 
were washed once by centrifugation, then plated in appropriate dilutions 
in //+ agar and in agar without histidine (J7— agar) to determine the 
proportion of h+ present. In experiments involving lactose fermentation 
markers, the h+ colonies from /]— agar were subsequently tested for lac- 
tose fermentation by streaking on the surface of Endo agar. The initial 
inocula for STs were prepared from the stock slants by inoculating test 
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FIGURE 3 

Simultaneous behavior of marked and unmarked 4+ in the same STs. 
A, various starting ratios of h+ which differ in lactose fermentation 
character from the h—. Points on the base line indicate time at which 
no h+ were detected. 8B, various starting ratios of h+ having the same 
lactose fermentation character as the h—. Solid circles in A are samples 
from the same cultures as solid circles in B. Likewise for open circles, 
triangles. Broken line indicates equilibrium level. 


tubes containing 10 ce. of 47+. These were grown to full turbidity at 
37°C. and mixed in the desired proportions. Where it was desired to be- 
gin the ST with no /+ present, the inoculum was diluted to contain about 
10*h—. When STs were to be started from h— and h+ isolated from pre- 
vious STs, ten colonies from /7+ and /7— agar, respectively, were isolated 
to nutrient agar slants and an inoculum from each was prepared as above. 
The ten h— inocula were mixed in equal proportions, the ten 4+ similarly 
mixed, and finally the desired starting ratio of h+ to h— was constructed 
from the two mixtures. 
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Results—In STs started from inocula containing oaly h—, the 4+ in- 
creased until after about 15 to 20 transfers (100 to 140 generations) an 
equilibrium ratio of about | 4+ per 10° h— was obtained as shown in 
figure 1(A). When STs were started with various initial h+,/h— ratios 
ranging from less than | 4+ to about 100 4+ per 10°h—, the equilibrium 
was reached eventually, as shown in figure 2. A curious feature of h+/h— 
ratios above equilibrium is the plateau region of some 30 transfers which 
always intervenes between the start of the experiments and the fall to 
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FIGURE 4 

A, solid circles: h+ and h— isolated at time indicated by arrow A in 
figure 2, and reconstructed at high ratioh+/h—. Triangles: Reconstruction 
of h— isolated from arrow A and original h+. Open circles: Reconstruction 
of h+ isolated from arrow A and original h—. 

B, open circles: h+ and h— isolated at time indicated by arrow B in 
figure 2, and reconstructed at high h+/h— ratio. Solid triangles: Repetition 
of same experiment. Solid circles: Reconstruction of h— isolated from arrow 
B and original h+. Open triangles: Reconstruction of 4+ isolated from 
arrow B and original h— 


equilibrium.* In addition to the data shown in figure 2, forty-eight similar 
experiments yielded the same results. 

A similar series of STs was set up with various starting ratios of h+ 
lac—/h—lac+ and also of h+lac+/h—lac—. These experiments show 
the effects of selection alone, since in each case the A+ arising from h— 
by mutation can be distinguished by its lactose fermentation character 
from the h+ added at the beginning of the experiment. The lactose fer- 
mentation markers were used reciprocally in different experiments to rule 
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out the possibility Chat they confer, in themselves, any selective advantage 
or disadvantage. In all cases the equilibrium ratio is reached by the un- 
marked h+ which arise by mutation from h— (Fig. 3(B)), while the marked 
(added) h+ remain essentially constant for about 30 transfers, then rapidly 
disappear as shown in figure 3(A). In addition to these data thirty-six 
similar experiments gave the same results. Thus, both marked and un- 
marked h+ have the same duration of temporary stability. 

The results of experiments with high h+/h— ratios and with marked 
h+ suggested that the h— had, at the time of the rapid decrease of h+, a 
selective advantage over the i+ present at this time. Since this inter- 
pretation seemed, superficially at least, to be contradicted by the concurrent 
maintenance of equilibrium of the unmarked h-+-, it was tested directly by 
reconstructing high h+/h— ratios from the h— and h+4- components iso- 
lated from plates at a time shortly after the beginning of the rapid decline 
of h+/h— ratio (arrow A) in experiments like that shown in figure 2. 
These reconstructed ratios showed in every case (eleven experiments in 
all) an immediate fall of 4+ ratio to equilibrium as in figure 4(A). These 
experiments demonstrate that a selective advantage of h— over h+ ex- 
isted at the time the components were isolated. However, further recon- 
struction experiments where the components were isolated from the same 
ST as in the previous experiment, but shortly after equilibrium was reached 
(arrow B, Fig. 2), showed an entirely different result as in figure 4(B). 
At this time no selective advantage of h— was manifested until after 30 
transfers, more or less, during which time the proportion of 4+ remained 
essentially constant. Finally, there was another rapid decline of h+ to 
equilibrium, just as in the previous experiments shown in figure 2, Thus 
we see that during the fall of h+/h— ratio to equilibrium the h— appears 
to have a selective advantage over 4+, but when equilibrium is reached 
both the h— and h+ components appear to have, temporarily, no difference 
in selective advantage. 

Now, the question arises whether the 4+ at equilibrium in these experi- 
ments has gained in selective advantage to match that of h—, or the h— 
has somehow lost its ability to overwhelm h+. To decide this, 4+ was 
isolated from an ST in which an initially high h+ /h— ratio had dropped to 
equilibrium, and was placed in competition with the h— which had been 
originally used to start the ST. In experiments of this type (eight in all) 
the h+ always rapidly replaced h— until the whole culture became proto- 
trophic. Thus we see that the selective advantage which appears first in 
the h— component in these experiments is very soon afterward possessed 
by both components (Fig. 4(B)). 


These reconstruction experiments have been done for three STs 
in sequence. To make this clear we can designate the initial com- 
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ponents of the first ST as 4o— and ho+. These were started at a high h+ /- 
h— ratio and after equilibrium was reached h— and + were isolated from 
the equilibrium population. Since they both can be distinguished from 
ho— and hy+ by their greater selective advantage they can now be desig- 
nated 4,— and h,+. This process was repeated starting with h;— and 
h,+ and we obtained h,— andh.+. Similarly, from these we have 4;— and 
hs+. The results of placing the third of these derived components in 
competition with the others in various combinations are shown in figure 5. 
Each successive cycle has resulted 
in a new strain which is able to 
overwhelm the previous _ strain. 

In the above experiments, the 
bacteria with selective advantage al- 
ways appear among the /—_ first. 
This is necessarily the case, if they 
arise by random mutation, since there 
were anywhere from 10‘ to 10° times 
as many h— as h+ present at all 
times. When experiments were done 
with 1:1 ratios of h+/h—, the mix- 
ture became unstable after a prelim- 
inary period of stability, and A+ 
became predominant in some experi- 
ments. 

Discussion.—We may regard the 
behavior of marked A+ and_ of seugnations 
h+ /h— ratios somewhat above equilib- 
rium as indicative of a _ process A, open circles: hy+ and ho—. 
which is taking place in the cultures Solid circles: M+ and h—. 
at all times and which would pass ht 

, open circles: hy— and ho+. 

unnoticed in the absence of such Solid circles: end 
indicator conditions. This process Triangles: and 
consists of the appearance of mutants 
which have a selective advantage over the bacteria from which they 
were derived. These mutants begin to replace the bacteria which were 
originally present. At some time, when the number of such mutants 
in the population is sufficiently high, they begin to give rise to other 
mutants which have, in turn, a greater selective advantage. The 
population is again replaced by this second mutant type, and soon, Under 
the conditions of our experiments these mutations usually occur in the 
h— component of the population, because h— are so much more numerous. 
We interpret the subsequent appearance of the mutants in the smaller h+ 
component as a consequence of the mutation of h— to h+. Thus the 
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mutation pattern of the above experiments may be written: 
ho . + hy an hy - h; 
| 


As each successive type of h— reaches its peak, the previous h— and 4+ 
are eliminated and the new 4+ bacteria must arise by mutation from the 
newh—. According to this interpretation, the h+/h— equilibrium is ac- 
tually the resultant of a series of curves as shown in figure 6. Since we 
have no knowledge of the rates of mutation to advantageous types, or of 
the relation between the proportion of selectively advantageous mutants 
and the strength of selection pressure, we cannot tell what sort of periodic 
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fluctuations in the equilibrium ratio are to be expected. Sampling error 
and the grouping together of data from many experiments, as in figure 1(A) 
conspire to mask any periodicity. In individual experiments where the 
population was sampled at frequent intervals, periodicity is visible as in 
figure 1(B). In any case, if this interpretation is correct, the equilibrium 
of h+/h— should be maintained at that level where the number of 4+ 
present is equal to the number which will be added (by mutation of 4— to 
h+ and subsequent growth of /+-) in the time necessary for the initial 4+ 
to disappear. Thus we see that the system wouid operate to produce the 
equivalent of selection against h+. Similar population shifts resulting 
from the appearance of more advantageous types have been —_— by 
Stocker®, Schneider’, and Novick and Szilard’, 

Lieb* has found the mutation rate, a, from h— to h+ to be 2.86 & 10° 
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and the mutation rate, 6, from h+ to h— to be 1.2 K 107% At mutatioral 
equilibrium: 

The observed h+/h— ratio at equilibrium averages 1.3 X 10~*. There- 

fore a strong selection against h+ must be postulated.* The average time 
necessary to eliminate the 4+ can be estimated from experiments like those 
in figures 2 and 3 and appears to vary between 200 and 300 generations. 
Since an equilibrium is maintained, it follows that the number of 4+ pres- 
ent at equilibrium must be added by mutation and growth within this time. 
Ryan and Schneider‘ have shown that in order to account for the rate of 
increase of h+ in small h— populations where 4+ mutants first appear, it is 
necessary to assume that each 4+ produces no additional 4+ for about 
two generations after it arises by mutation. They have attributed this 
to segregation lag, which will be fully described in their paper. If we as- 
sume a segregation lag of two generations and compute the 4+ added per 
transfer from the following equation (valid for g > 3): 


h+ = aNo[2°-! + 20-2 + (g — 2)20-] (2) 


Where a = 2.86 X 107° is the mutation rate from h-- toh+; Ny = 
2.5 X 10* is the A— inoculum of each transfer; and g = 6.64 is the number 
of generations per transfer, we find that 950 4+ are added per transfer, of 
which 9.5 on the average are included in the next inoculum. At the equilib- 
rium of 1.3 h+/10° h—, 325 h+ are carried on the average in each inocu- 
lum. Now by the time 325 additional h+ are added, the original 325 must 
have disappeared for equilibrium to be maintained. Thus the replacement 
of hi\+ by h2+, for instance, would occur with this assumption within 
325/9.5 = 34 transfers (or about 225 generations). This figure compares 
very favorably with the time necessary to eliminate marked h+ in experi- 
ments such as those in figure 4, and thus lends support to the assumption of 
segregation lag. It should be reemphasized that no prediction about the 
shape of the equilibrium curve follows from this line of reasoning. It 
serves only to show that the replacement phenomenon supplies selection 
of the right order of magnitude to account for the equilibrium. 

We suppose that the periodic replacement phenomenon which we ob- 
serve in these cultures would take place in the absence of the conditions 
for its recognition. Thus the selection against 4+ in the presence of h— 
may be regarded as having no relation to any intrinsic properties conferred 
by these genotypes. The genes in the population which are favored by 
periodic replacement are simply those having numerical superiority, since 
the selectively advantageous mutations will virtually always occur in as- 
sociation with these genes. When the predominant type is replaced periodi- 
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cally, all other mutant types which it has produced are eliminated and 
must arise anew by mutation (Fig. 7). Thus periodic replacement selects 
simultaneously against all types arising by mutation, whether or not they 
are selected against in any other way. 

It is important to note that this does not affect the possibilities for adap- 
tation to new environments, since if the environment selects against the 
predominant type, periodic replacement will be arrested until the popula- 
tion is transformed by the adaptive growth of the appropriate mutant. 
For example, if h— containing a small proportion of h+ is placed in H— 
medium the adaptive growth of h+ results.’ 

Some appreciation of the adaptive value of periodic selection can be 
gained from a consideration of the stability of a prototrophic population 
on a complex medium. There are reasons for believing that the mutation 
rates to auxotrophy are in general higher than the rates to prototrophy. 
As has been pointed out by Lieb* there are more ways of genetically block- 
ing a reaction sequence than of restoring a single genetic block. In the 
case of h— and h+ Lieb found the mutation rate to auxotrophy to be more 
than fifty times that to prototrophy. Assume that a prototrophic popula- 
tion which is mutating to (conservatively) 25 different auxotrophic condi- 
tions with a fiftyfold mutation pressure toward each is placed in a medium 
which does not select against these auxotrophs. The equilibrium value 
for each independent auxotrophic condition would be 49 auxotrophs to one 
prototroph. With only 25 such equilibria independently established, the 
frequency of prototrophs in this population would be about 10~**. This is 
a total loss of ability to readapt to a simple medium. Thus, in the absence 
of some other equally general mechanism, periodic selection becomes a 
necessary condition for bacteria which remain prototrophic while growing 
on complex media. 

In estimating the general importance of this form of selection as a stabi- 
lizer of bacterial populations, the prime consideration is whether selectively 
advantageous types can continue to occur indefinitely, or whether the 
potentiality for such mutations will be exhausted within a certain time. 
At this time we can present no data which give a final answer to this ques- 
tion. In one serial transfer experiment which lasted for 1000 generations 
the equilibrium was maintained throughout. If the periodic selection 
mechanism failed to operate, the 4+ would increase either to mutational 
equilibrium or possibly to an equilibrium governed by some other mecha- 
nism of selection. Since this did not occur the conclusion is justified that 
the population was replaced about 4.5 times in this experiment. As we 
have seen, there are more general considerations which seem to demand the 
existence of some mechanism which acts to prevent the increase of many 
mutants simultaneously. While it is difficult to imagine an indefinite in- 
crease in the ability of a bacterial population to select against all ancestral 
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populations, it is perhaps more credible to suppose that such selective ad. 
vantage might operate only on a limited number of the ancestral popula- 
tions and that eventually we might arrive at a point where some ancestral 
type would actually have a selective advantage over the type at hand. If 
all of these types are accessible by mutation we may propose a mutational 
pattern such as shown in figure 7, in which each member can select against 
only a few of the previous types. This would insure the perpetual operation 
of the periodic selection mechanism. This scheme does not imply that the 
mutations are directed, since mutations to types which are temporarily 
disadvantageous may constantly occur without observable effect. The 
value of such speculation, and also the question of whether selection in- 
volves interactions between populations or is a consequence of intrinsic 
growth rate differences remains to be determined by further research. 
Summary.—Experiments involving the serial transfer of cultures of 
E. coli for long periods of time show that, for reasons not fully under- 
stood at present, the population is periodically replaced by a new type 
arising in the predominant component and characterized by a_ selec- 
tive advantage over the previous type. Thus any clone within the popula- 
tion has a certain maximum lifetime which limits the number of mutants 
which can be produced and established during its growth. Consequently, 
the predominant genotype does not yield to mutation pressure. This 


phenomenon may be regarded as a general mechanism that postpones the 
establishment of mutational equilibria. 
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MORPHOLOGICAL DIFFERENCES BETWEEN TWO SIBLING 
SPECIES, DROSOPHILA PSEUDOOBSCURA AND DROSOPHILA 
PERSIMILIS 


By M. T. M. Rizx1 
CoLUMBIA UNIVERSITY 


Communicated by Th. Dobzhansky, December 20, 1950 


The work on the population genetics of the sibling species, Drosophila 
pseudoobscura Frolova and Drosophila persimilis Dobzhansky and Epling 
has long been handicapped by the impossibility of distinguishing single 
individuals of these species by any morphological trait, and the consequent 
necessity of making cytological examinations or genetical tests. Lance- 
field' first differentiated these forms as ‘‘races or physiological species’ on 
the basis of their genetic behavior, and called them “‘race A’’ and “‘race B”’ 
of D. obscura Fallen. Frolova and Astaurov? showed that Lancefield’s 
“race A”’ from western United States differs from the European D. obscura 
in chromosome complement and in genital structure, and proposed for the 
former the name D. pseudoobscura Frolova. Because of the effectively com- 
plete reproductive isolation between “race A’”’ and “race B,’’ Dobzhansky 
and Epling* recognized the latter as a separate species, D. persimilis. 
Mather and Dobzhansky,‘ Reed, Williams and Chadwick® and Reed and 
Reed® made a statistical comparison of certain morphological characters in 
D. pseudoobscura and D. persimilis. Some average differences were found, 
but the overlapping of the variation curves proved so great that these dif- 
ferences were of no practical significance in recognition of the species. 

At the suggestion of Professors E. Mayr and Th. Dobzhansky, the writer 
undertook a detailed comparison of the male genitalia of the two species. 
Ten strains of D. pseudoobscura and ten of D. persimilis, derived from flies 
collected in various parts of the geographic distributions of each species, 
were used. All stocks were maintained at 16°C. The male flies were 
macerated in a 10% solution of KOH, washed in water, the genitalia re- 
moved, stained with carbol fuchsin and mounted in glycerin. Ten meas- 
urements from each strain were made with the aid of an ocular micrometer, 
one unit of which is equal to 0.0042 mm. 

The structure of the genital arch and the hypandrium in the two species 
has been found to be alike. A clear-cut difference has, however, been 
noted in the dimensions and the proportions of the penis (Figs. 1 and 2). 
A description of the morphology and homologies of various parts of 
Prosophila male genitalia has been given by Salles.’ 

The penis of D. pseudoobscura is relatively long and cylindrical, whereas 
that of D. persimilis is shorter and broader at its base with a tapering ap- 
pearance. The length of the penis is measured from the base of its lamina, 
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excluding the articular condyle, to the distal end of the lamina. Care was 
taken not to rupture the membrane at the tip of the penis. The width was 
measured at the greatest curvature of the penis’ lamina with respect to 
the mid antero-posterior axis of the penis (Fig. 3). The penis index is 
computed as the ratio: length of the penis/ width of lamina. The results 
of the measurements are summarized in table 1. 

There is no significant difference among the geographical strains of D. 
pseudoobscura, nor among the strains of D. persimilis. However, the dif- 
ference in the index of the two species is highly significant. The mean index 
for D. pseudoobscura is 8.61 + 0.31; the mean index for D. persimilis is 
6.48 + 0.51. Considering the extremes in the two species, we may con- 
clude that any individual with a penis index equal to or above 7.64 is 


FIGURES LAND 2. Camera lucida drawings of the penis and accessory parts in Drosophila 

pseudoobscura and D. persimilis, respectively: F, forcep; P, penis; S, stylus. Ficure 

3. The measurements taken on the lamina of the penis: L, length; PL, penis lamina; 
W, width. The scale represents 100 micra. 


pseudoobscura; a penis index equal to or below 7.60 indicates persimilis. 
One pseudoobscura male out of 100 measurements had an index 7.64. This 
extreme was obtained from different absolute units than the 7.60 index of 
persimilis. Considering this situation, we should employ both criteria 
for classification, On the basis of the shape of the penis and also the index 
we are definitely able to identify males of the two species. 

The question that now arises is whether such morphological differences 
as those described are of any value in understanding sexual isolation be- 
tween the species. Mayr* believes that some functional difficulties are en- 
countered in heterospecific matings such that “incomplete copulation’ is 
the result. Our findings show definite differences in the size and shape of 
the penis of the two species; however, we can not assign the functional 
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difficulties to this structure alone. In addition, as indicated by the points 
of articulations, the styli have a functional relation with the penis. It 
seems hopeful that further studies on the mechanism of the genitalia of 
D. persimilis and D. pseudoobscura may reveal some coordinated move- 
ments of various parts constituting a specific sequence which may differ in 
the two species. 


TABLE 1 


MEASUREMENTS AND PENIS INDICES OF DIFFERENT STRAINS OF D. Persimilis AND D. 
Pseudoobscura 


MEAN MEAN 
STRAIN LENGTH WIDTH 


Pseudoobscura 

Atitlan, Guatemala 5.00 
Chichicastenango, Guatemala ; 5.00 
Amecameca, Mexico 
Cuernavaca, Mexico 

Black Mesa, Arizona 

Coffee Creek, California 

Alturas, California 

Zion, Utah 

Aspen, Colorado 

Mara, British Columbia 


8. 20-8. 80 
8. 80-9 .00 
8. 40-9 .00 


tt tt t 


Persimilis 

Aspen Valley, California 37.9 

Coffee Creek, California 39.2 

Deer Creek, California 38.5 

Nahogui, California 38.6 

Orick, California 36.9 

Porcupine Flat, California 37.3 

St. Helena, California 37.7 

Sequoia Park, California 38.5 

Hope, British Columbia 37.5 

Quesnell, British Columbia 38.4 : 6.40 
Mean penis index for D, pseudoobscura 8.61 * 0.08 
Mean penis index for D. persimilis 6.48 + 0.05 


tht tt t 


617-6. 50 


Nore: Mean length and mean width are in ocular units. The mean penis index of 
each strain is given with its standard error. 


Summary.—Because of a complete reproductive isolation, Drosophila 
‘pseudoobscura and D, persimilis are considered separate species. However, 
no morphological differences between them were known. It is shown that 
the males of these sibling species can easily be distinguished by the shape 
of their genitalia. 

Acknowledgments. —The writer is deeply indebted to Professor Th. Dob- 
zhansky for his interest and guidance throughout the course of this study. 


| | | 
| 
| 
| j 
{ 
j 
MEAN LIMITS 
8.58 0.057 
8.64 0.057 
8.82 0.120 8.60-9.78 | 
8.51 0.066 8. 20-8.89 
8.18 0.095 7.64-8. 60 
8.59 0.089 8.18-9.00 
8.50 0.054 8.20-8. 80 
8.50 0.082 8.20-9.11 
& 92 0.066 8.60-9. 20 
0.029 6.17-6. 50 
0.123 5.57-6.18 
0.146 5. 43-6. 50 
0.114 6.33-7.40 
0.136 5. 83-7. 20 
0.041 6.73-7.20 
0.123 6.55-7 .60 
(0.086 633-709 
0.165 600-7 .60 
O17 
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A CONDITION FORA REAL LATTICE TO DEFINE A ZETA 
FUNCTION 


By N. C. ANKENy* AND C. A. ROGERS 


DEPARTMENT OF MATHEMATICS, PRINCETON UNIVERSITY, AND LONDON COLLEGR, 
LONDON UNIVERSITY 


Communicated by S. Bochner, January 31, 1951 


In Theorem | we shall prove the following: If the product of m linear 
forms in n variables, whose coefficients are real and whose determinant is 
positive, assumes only a finite number of different values in a finite inter- 
val, and assumes the value 0 only if all the variables are 0, then one of the 
linear forms arises from a ring in an algebraic number field, and the other 
n — | linear forms are the n — | different conjugates of the first linear form. 
Hence, the product is essentially the norm of all numbers in an order of an 
algebraic number field, which clearly takes on only a finite number of 
values in any finite interval. 

This problem arose in connection with a forthcoming paper of S. Bochner, ' 
“Some Properties of Modular Relations.” We show that if any real lat- 
tice gives rise to a zeta function, the lattice comes from an order in an alge- 
braic number field.» One can expand this to prove that the zeta function 
will satisfy a certain functional equation.* To define a zeta function from 
any real lattice it is clear that one must have certain properties of discrete- 
ness on the product of the forms to insure convergence. However, in the 
proof of Theorem 1, discreteness of the product is needed for only a finite 
interval. We thank Professor Bochner for letting us see in advance a copy 
of his paper. 

Definition: Let x, be real linear forms in wy, i.e., 


= + + + = 1,2, ..., (1) 


where w, is real and the determinant A = |w,”| is positive. 
Denote by 


| 
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P(X) = xx = TT + uw, ) (2) 
Let A denote the lattice consisting of all points in (1) where ™, ..., Up 
take on all integer values. 

THEOREM |. Suppose that there is no point X of A other than the origin 0 
with P(X) = O and suppose that P(X) assumes only a finite number of dif- 
ferent values a, with ~A S aS A. Then the product P(X) can be expressed 
in the form 

P(X) = w TT (ww + ... + uw, 
i=] 
where ..., are algebraic integers in a field of degree n and 
(i = 2,3, ..., are their (n — 1) different algebraic conjugates. 

1. Proof: Lett, ...,t,-1 bem — 1 real numbers which are linearly in- 

dependent over the rational numbers. Define ¢, by 


= 0. 


Write A = 6". Then by Minkowski’s fundamental theorem on inequalities 
for linear equations, there is a point Y of A satisfying 


for all positive numbers A. For such a point we have 
P(X) & A. 
By hypothesis, P(X) can take only a finite number of values for such a 
point X. Thus, there is a nuniber a # 0 with the property that for suit- 
able arbitrarily large values of \ there is a point Y of A satisfying 
P(X) (3) 
and 


|x| SO", (4) 


Let I’ be the set of points X of A satisfying P(Y) = a Let Ui, U2, ..., 
LU, be the different products which can be obtained by multiplying between 
{ and » of the variables m4, ..., 4,. Then we can write 


P(X) = QUit + AUVs. 
We know that for every point Y of [ 


and that Ll, ..., 0) are integers as the u’s are integers. For each point 
X of 1 we regard the result 


oes = (5) 
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as an equation for &, ..., Qa. 

II. Our first object is to prove that this system of equations (5) deter- 
mines %, ..., 2, uniquely. We suppose that the equations have a second 
solution Q; + 01, ..., 2, + O,. Then we have 


OU; +...7¢ 


for all points X of f. As A # 0, the m, ..., 4, are linear forms in x, 
...,¥, So we can express 


OUi + ... + 


as a polynomial in ..., x. Let Vi, Y2, ... be the monomial expres- 
sions of x, ..., x arranged in some order. Then 0,U; + ... + O,Us 
will be a linear form in ¥;, Y2, ..... We may choose polynomials p;(x), 
po(x), ... so that 


+ = p(x ... x™) Vp, 


where p(x) vanishes identically whenever J}, is divisible by x'x .. 
x™. We may suppose that 1j, Y2, ... are ordered so that 


p,(x) does not vanish identically for p < 7, 
p,(x) vanishes identically for p > r. 


‘Then we have 


OUit ... = p,(P(X))Y, 


identically and for all X of T 


p,(P(X))¥, = 0. (6) 


We take a sequence of values of \ tending to + © and consider a corre- 
sponding sequence of points X of T. We have 


P(X) = II x = 


i=l (7) 


= ) Ba, (x™ )Bno, 


| 
| 
|x| <= i= — 
| So 
1 In| > 
If 
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where {;,, ..., Ba, are integers, we see that as the exact order of Y, is 

If two of the monomials Yi, ..., Y,, say Y, and Y,, had the same order of 
magnitude as \ we would have 

so that 

h(Bi, Bia) tn(Bnp Bue} = 0 
and by our choice of t, ..., t, we would have 

iy Bio Be, — Bre Bap — 


But this is impossible unless o = p since neither Y, nor Y, is divisible by 
x... Thus no twoof ..., have the same order of magnitude 
as \-* » and X runs through a sequence of points of T’ satisfying (7). 
Since these points all satisfy (6) we see that 


p,(a) = 0. 


Hence, the polynomials p,(X), ..., p,(X) are divisible by the polynomial 
(Y — a). Comparing the degrees in the identity 


+ ... +0,U, = b,(P(X)) Y,, 


we see that r = 1, ¥; = | and 
OU; + ... + OU, = a(P(X). a) 
for some constant a. But the expression on the left has no constant term 
and ais #0. Hence, a = 0, and 
=O =... =®=0. 
Thus the equations (5) determine &%, ..., Q, uniquely. 


Ill. As the numbers ..., U, are rational integers, we have by Sec- 
tion II that the 2's are rational multiples of a. Therefore, 


P(X) = (uw + (8) 
i=l 


when expressed as a polynomial in %, ..., 4, has all coefficients as rational 
multiples a. Divide (8) by (wa ... wi™) giving 
wP(X) = Il (uy Uy (9) 
when expressed as a polynomial in 4, ..., %, has all its coefficients rational 


mutiples of 
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a 


However, the coefficient of 4," is 1. Hence all the coefficients are rational 
numbers. 

Letting uw, = 1, u3 = uy = ... = ty = 0 we see »» for (¢ = 1, 2, ..., 2) 
are all algebraic numbers. Similarly, we have »,;‘9 for i, 7 = 1,2, ...,m 
are all algebraic numbers. 

Let K = where R is the rational numbers (i.e., 
K is the least algebraic extension of R containing ..., Asm + 
mu, O unless uy = = ... = = 0, we see that 
(K:R)=n'2n,. Hence, consider the n’ different isomorphisms of K leaving 
R fixed. As they leave R fixed they leave the coefficients of the u’s in the 
expansion of (9) fixed. By Hilbert’s theorem on the uniqueness of the 
irreducible factors of a polynomial in m variables, we see that the n’ iso- 
morphisms of K must merely permute the linear factors on the right side 
of (9). As all isomorphisms of K act differently on u; + man"? +... + 
Unvn', we see that n’ = n, and the linear factors on the right-hand side of 
(9) are the n — 1 different conjugates. This proves Theorem 1. 

IV. It is of some interest to note that to prove Theorem 1 we do not 
need the full hypothesis of discreteness. Actually we only need to note 
the fact that there exists one a # 0 which satisfies (3) and (4) for suffi- 
ciently large values of A. This modification has application to the critical 
lattice of m dimensional paraboloids. However, in this present note, we 
cannot take up this problem in any detail. 

* Under the auspices of the Office of Naval Research. 

' Bochner, S., Ann. Math., soon to be published. 

2 Siegel, C. L., Gottingen Nachr., 31, 25-31 (1922). 

3 Siegel, C. L., Math. Annalen, 85, 123-128 (1911). (See also footnotes 1 and 2 
above.) 


THE JACOBSON RADICAL OF A SEMIRING 
By SAMUEL BOURNE 
INSTITUTE OF ADVANCED Stupy, Princeton, New Jersey* 
Communicated by H. S. Vandiver, December 18, 1950 
1. Introduction._-A semiring is a system consisting of a set S together 
with two binary operations, called addition and multiplication, which forms 
a semigroup relative to addition, a semigroup relative to multiplication, 


and the right and left distributive laws hold. This system was first in- 
troduced by Vandiver.' He also gave examples? of semirings which cannot 
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be imbedded in a ring. Semirings arise naturally when we consider the set 
of endomorphisms of a commutative additive semigroup.’ 

Our purpose is to generalize the concept of the Jacobson radical of a ring* 
to arbitrary semirings. In section 2 we define the concept of an ideal in a 
semiring S and develop the corresponding homomorphism theorem for 
semirings. In section 3 we extend the definition of the Jacobson radical 
to arbitrary semirings, and in section 4 we obtain some properties of the 
Jacobson radical of a semiring. We conclude with a consideration, in sec- 
tion 5, of the Jacobson radical of matrix semiring S,. 

This paper has profited greatly from discussion with C. A. Rogers, a 
colleague of mine at the Institute. ‘ 

2. The Homomorphism Theorem...We shall assume that the additive 
semigroup of S is commutative and that S possesses a zero element. The 
latter assumption is not vital in the sense that if S lacked a zero element, we 
can easily adjoin one to S. 

Definition 1: An ideal of S is a subset J of S containing zero such that 
if 4, and % are in J, then 7, + % is in J, and if 7 is in J, and s is any element 
of S, then 7s and si are in J. 

We shall say that s; is equivalent to s. modulo the ideal J, if there exist 
elements 7; and 7 of the ideal J such that s; + 4; = so + %. This definition 
is a translation to the additive notation of one given by Dubreil® for a mul- 
tiplicative semigroup. This relationship is obviously an equivalence. 
The set of elements § equivalent to each other modulo the ideal / is called 
a coset of the ideal 7. Relative to the usual definitions of an addition and 
multiplication the cosets of an ideal J of a semiring form a semiring, called 
the difference semiring S — J. The correspondence s — § defines a homo- 
morphism of S into the difference semiring S = S — J. Conversely, if 
the semiring S is homomorphic to the semiring 5S’, then the elements of S 
mapped into 0’ of S’ form an ideal J, and S’ is isomorphic to § = S — J, 
via the one-to-one correspondence s’ <—— > §. We have proved 

Tueorem |. the semiring Sis homomor phic to the semiring S’, then is 
isomorphic to the difference semiring S — I, where I is the ideal of elements 
mapped into 0’. Conversely, if I is an ideal of S, then Sis homomor phic to the 
difference semiring S — I. 

3. The Jacobson Radical of a Semiring.--Definition 2: The element r 
of the semiring S is said to be right semiregular if there exist element r’ and 
r"in Ssuch that r +r’ + rr! =r" + rr’. 

We notice that in the case S is a ring, this definition reduces to the one 
usually given for right quasi-regularity in a ring. 

TuroreM 2. A necessary and sufficient condition that the element r of S 
be right semiregular is that for any element s in S there exist elements s' and 
s" in S such that s + s’ + rs’ = s" +975". 

Proof: Vi r is right semiregular, then r + r’ + rr’ = r" + rr”. There- 


Vou. 37, 1951 MATHEMATICS: S. BOURNE 165 


fore,s On letting s’ = r"sands" = 
s+ r’s, we obtain s + s’ + rs’ = s” + rs". 

Conversely, we suppose that s + s’ + rs’ = s” + rs", forany sin S. In 
particular, r + s’ + rs’ = s” + rs” and ris right semiregular. 

Definition 3: The right ideal / is said to be right semiregular, if for every 
pair of elements 7), 72 in / there exist elements j,; and j, in 7 such that 4, + 
hi + = te + je + tije + tots. 

The elements j; and j, are not unique, for this condition implies that 
ijt jet = + je t+ tije + tej: + Jj, where j is any element 
+ in(j:1 + j). In particular, on substituting % = 0, we obtain that 4, is 
right semiregular. 

Lemna |. Jf I and I* are right semiregular ideals, then I + 1* ts a right 
semiregular ideal. 

Proof: Since J is right semiregular, for every pair of elements %, 72 in / 
there exist elements j; and j2 in J such that 4 + ji + tiji + taj = t2 + 
de + je + ji. If 4,* and are in then 4,* jy and to* 
+ + je are in /*. 

Since /* is right semiregular, there exist elements j;* and j.* in /* such 
that + + + + + + + + + 
jo® = (12% + + + + + je) + + 

Therefore, (¢; + + (ji t+ + + jojo*) + (et (ji + + 
+ jojo*) + (ta + *) (jo + 
tojo) + [(ti* + + + + + + + + 
+ *j2) jo*) + (ti t+ tj + + (a + + + bj) = 
(42 + jo + toje + + + + + + + + 
jo* + + + + (bo + he + tije + + (i + 
dit + jo* = (ta + + + + + + + 01%) 

(j2 + jo* + + + (ta + + + + 

Since j; + + + and jx + j.* + jije* + joji* are in] + 
this equation implies that J + /* is right semiregular. 

THEOREM 3. The sum R of all the right semiregular ideals of a semiring S 
ts a right semiregular two-sided ideal. 

Proof: Lemma | implies that R is right semiregular. If 7; and rz are 
in R, then r;s and rs are in R, for any sin S. Hence there exist elements 
and r,in R such that ris + 73 + risrs + resrs = ras + + + 
Whence, srisr; + srari + srisrary sresrary = Sresry + srary srysrary + 
Sresrafo. Adding the last two equations and adding sr; + s7z to both sides 
of the resulting equation we obtain that sr, + (sry + sry + sree) + 
sri(sry + + srare) + sro(srary + sre + srare) = sry + (sre + + 
+ sro(srs + srati + Stare) + sri(ste + + srari). Since sty + 
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+ and sre + + are in SR, this equation implies that sR 
is right semiregular and sR < R. Hence R is also a left ideal and R is a 
two-sided ideal. 

Definition 4: The right Jacobson radical R of a semiring S is the sum 
of all the right semiregular ideals of S. 

In a corresponding manner, we obtain the left Jacobson radical R’ as 
the sum of all the left semiregular ideals of S. An ideal of S is said to be 
semiregular if it is both right semiregular and left semiregular. 

Lemma 2. I[f «4 and i, are contained in the right ideal I of S, and if 4, + 
ti ji + i2j2 + je + + tej, and i, + ki + kit) + = ty + ko + 
ke, + Rito, where jy, jo, ki and ke are in I, then there exists an element | in I 
such thatk; + 

Proof: ki + (4, +i + + 12 j2) + +i + + je) + + 
je + hi je + + (4, + ky + + het) + (a, +k, + + Rot) jr + 
(t2 + ke + + Rite) je. 

Hence ky + + jo + tije + tafi) + Rilte + jo + + + + 
dit tifa + taja) = (to + he + Rats + Rite) + + he + hot + hie) ji + 
+ Ri + Rit, + Reta) jo. 

Therefore, k, + je + (1) + ki + + Retz) je + (42 + ko + Rite + ket) ji 
+ ty + + = he + + (to + he + Roti + Rite) + + + Rit + 
ja + t2 + kot + itr. 

We define = (i, + ke + kat; + Ritz) ju + (4 + ki + + je + 
ty + koti + Rite. . 

Then ky + jo + 1 = ke + ji + 1, where 1 is in J. 

Lemma 3. The right Jacobson radical R of a semiring S 1s a left semiregu- 
lar ideal. 

Proof: Theorem 3 states that R is a left ideal. We let 7; and r2 be any 
two elements of R. Since R is right semiregular, there exist elements s; and 
so in R such that + 5; + 715) + = re + Se + + Therefore, 
St t+ ty + Siti + Sate = Sp + te + Site + Soli, where and are in R. 

By Lemma 2 there exists an element v in R such that t; + r2 + v = fp + 
ro +v. Therefore, + 7+ Ste 
Sila + OF Sy + Siti Sole +h + + Site + Sah +h + v. 
Whence, + + + Sole + + +e = + 
Shors t+ Hh + svth +0 = 
+ Sh + +h + Sire. 

Finally, + +: Siti + Sore + = Sy + + Site + Sori + Where u is 
in R. Since s; and s are in R, this equation implies that R is a left semi- 
regular ideal. 

THEOREM 4. The right Jacobson radical of a semiring S is equal to its left 
Jacobson radical R’. 

Proof: Lemma 3 implies that R < R’. From the symmetry of our 
results, we have that R’ < R. Hence R = R’. 
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We may now refer to the Jacobson radical R of a semiring S. 

4. Properties of the Jacobson Radical.—Definition 5: The semiring S 
is said to be semisimple if its Jacobson radical R = (0). 

Definition 6: The semiring S is said to be a radical semiring if its Jacob- 
son radical R = S. 

TueoreM 5. If R is the Jacobson radical of the semiring S, then the dif- 
ference semiring S — Ris semisimple. 

Proof: Welet S = S — Rand R be the Jacobson radical of §. If 7, and 
ry. are in R, there exist elements §, and § in R such that 7, + 3 + 75, + 
= + Se + + This implies that 7, + 5; + + + 73 = 
ro + so + rys2 + res; + 74, where r3 and 7, are in R, and rj, re, s; and s2 are in 
the set R* of all elements of S belonging to the cosets of R. R* is an ideal 
in S, since Ris an ideal in §. R* contains R, since 0 is in R. 

Since r; and r, are in R, there exist elements 53 and s, in R such that 7, + 
Sa + 1454 + 7353 = 13 + S3 + 1354 + 

If we denote the left-hand side of this equation by (e) and its right-hand 
side by (e)’, then (e) + s3 + (e)sa + (e)’ss = (e)’ + 53 + (€)’sa + 
yields that ry + 5) + + 7252 + Sa + S184 25254 + 
1354 + + S253 + 18253 + + = + 5283 + + 
Sq + S484 + S253) + + S254 + Sa + 5153) + 73 + Sa + + 1453 = + 
+ S254 + S183 + 11(S2 + + S254 + S83) + + Sa + + S253) + 
+ 53 + 1454 + 1353. 

We let = 51 + 5154 + 5253, to = So + + lg = 73 + + 
ty = + 135s. 

Since 74 + 54 + 1454 + 7353 = 13 + S3 + 7354 + 1453, We have that t; + s; = 
ts + ss. On substituting we obtain that + + ri(ti + 54) + ro(te + 53) + 
te t+ = + (te + 53) + + +h t+ Ss. 

On adding to both sides (7; + r2)ts we obtain that 7, + (t; + 54 + 4) + 
ri(ti + ty) + + S83 + by) = (to + + + + + + 
ro(ti + + 

Since t; + 54 + 4 and ft, + ss + 4 are in R*, this equation states that R* 
is a semiregular ideal in S. Therefore R* < Rand R = (0). 

THEOREM 6. The Jacobson radical of a semiring S is a radical semiring. 

Proof: We let R(.S) denote the Jacobson radical of S and R(R(S)) the 
Jacobson radical of R(S). Since R(S) is semiregular in R(.S), then R(S) < 
R(R(S)). Trivially, R(R(S)) < R(S). Whence, R(R(S)) = R(S). 

Thus the structure of an arbitrary semiring is reduced to the considera- 
tion of semisimple semirings and radical semirings. 


1 
If r” = 0, then on letting r’ = r* andr” = r* ~! we obtain 
i i 


thatr + 7’ + roar” +rr”. Hence if an element is nilpotent in S, it is 
automatically semiregular. However, the converse of this statement is 
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false. For, in the semiring of non-negative rattoral nu:nbers 2 is semi- 
regular while not nilpotent.® 

THEOREM 7. Every nilpotent right or left ideal of a semiring S 1s contained 
in the Jacobson radical R of S. 

Proof: We let I be a nilpotent right ideal of degree n, that is J" = (OQ). 
We wish to prove that / is a right semiregular ideal. It can be easily veri- 
fied that, if we define 

2n — 1 
Oj - tye t = 1,2 


ji, jk = 1,2 


and 


{ 1 if j, = 1 for an even number of s, 


| 0 if 7, = 1 for an odd number of s, 


= 1 — .-- Je), 


then 4; + sy + a5) + tose = tg + So + tos; + 4452, where s; and so are in /. 
This implies that / is a right semiregular ideal and that J < R. 

Definition 7: The principal right ideal (v) is the set of elements {vs + 
in}, where s is an arbitrary element of the semiring S and n a non-negative 
integer. 

Corouiary. /f vis an element such that < R, thenv < R. 

Proof: If 1 is the principal right ideal (v), then J; < S/S. However, 
SvS = SIS. Trivially, SuS < S/S. An element of S/S is of the form 
+ vn)so = + syns, = Therefore, SIS < SvS 
and the result follows. Thus /* < SvS < R. Now (I + R)* = Si, + 
r1)(i2 + re) (is + 73) = + Yr. Therefore, (J + R)*< R. Wehave 
then that J = (J + R) — Risa nilpoint ideal of degree 3 in the semisimple 
semiring § = S — R. Theorem 7 states that ] < R = (0), by theorem 6. 
Therefore J < R, and, in particular, v € R. 

Definition 8: A semiring S is said to be regular, in the sense of von 
Neumann,’ if for every element s in S there exist elements x’ and x” in S 
such that sx’s = s + sx"s, 

We notice that in the case S is a ring, this definition reduces to the one 
given by von Neumann. 

THroremM 8. If R ts the radical of a regular semiring S, then for any ele- 
ment r in R there exists an element win R such thatr + w = w. 

Proof: We let re R, then rx’r = 7 + rx"r. Since x”r and x’r are in R, 
there exist elements and s. in R such that + sy + + = 
x'r + + (x’r)sy + (x"r)se. Therefore, + rs; + rx"rsy + rx'rs, = 
rx'r + + rx’rsy + rx"rso. Whence, rx"r + rs, + rx"rsy + + 
= + + rs, + + rx"rss. We define w = + 
rs; rx"rsy + + and obtain that r + w= w,we R. 


~ 
| | 
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We notice that in the case S is a regular ring, this theorem implies that S 
is a semisimple ring. 

5. The Jacobson Radical of a Matrix Semiring S,..-Lemma 4. If sy 
is right semiregular in the semiring S, then the matrix M = (sj), sy = 90, 
i > 1, 4s right semiregular in the matrix semiring Sy. 

Proof: Theorem 2 yields that s + s’ + sys’ = s” + sys”. Therefore, 
Sty + = 515” + We have, then, that M + M’ + MM’ 
= M” + MM", where M’ = (s,,’), sy’ = 0,4 > 1, and M’ = (s,,"), 
sy” = 0,4 > 1. Thus M is right semiregular in S,. 

Lemma 5. If Rts the Jacobson radical of a semiring S, then the right ideal 
T, of all matrices M = (ri), rye R, ry = 0,1 k, ts a right semiregular ideal 
in Sy. 

Proof: The fact that /, is a right ideal is obvious. We let M, = (rij) 
and M,* = (r,;*) be any two matrices of J,. We consider the pairs of 
elements (7;;, %;*), 7 = 1,2,...,m. Since and r,;* are in R, there exist 
elements in R such that ry; + Sey = + 
Sey + We define S; = (54), sy = 0,4 k and S,* = 
(sij*), = 0,4 AR. Then, we can easily verify that + + + 
M,*S,* = M,* + + M,*S, + M;S,*, where Si and are in Ty. We 
have proved that /, is a right semiregular ideal in S,. 

THEOREM 9. Jf R is the Jacobson radical of a semiring S with a unit 
element, then R,, is the Jacobson radical R(S,) of the matrix semiring S,. 

Proof: Now R, = Dot J,. Since Lemma 5 states that /, is a right semi- 
regular ideal, then R, is a right semiregular ideal by Lemma 1. Hence, 
R, < R(S,). 

We wish to prove that R(S,) < We let M = (sy)e R(S,) and 
Ty = (tim) lim = 0, 4, m ~ i, = 1. Then D= ve Sp] = 
TipMT » is contained in R(S,). 

We consider the totality of diagonal matrices of the form D = [d, ..., d] 
which are contained in R(S,). Then the set A of elements, which are 
contained in the first place of D, forms a right idealin S. If Dj = [d, ..., 
and D, = ..., are in R(S,), there exist matrices S,; and in 
R(S,) such that dD, + Si + DS; = Ds Se 
Using matrices of type 7), previously defined, we can modify this to read 
dD, + + DE, Doky = Dy + + + Dok, where ky = len", 
en*, ..., en*], k = 1,2. We have then that d; + en! + den! + deen? = 
dy + ey? + den? + d2e\', where e,,' and e,,* are in A. This implies that 
Aisa right semiregular ideal and A < R. This yields that s,,¢ Rand M < 
R,. Therefore, R(S,) < R, and R(S,) = R,. 


* Present address: University of Connecticut, Storrs, Conn. 
1 Vandiver, H. S., ‘‘Note on a Simple Type of Algebra in Which the Cancellation Law 
of Addition Does Not Hold,’’ Bull. Am. Math. Soc., 40, 920 (1934). 
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2 Vandiver, H. S., “On Some Simple Types of Serai-rings,” Am. Math. Monthly, 46, 22- 
26 (1939). 

* Jacobson, N., Theory of Rings, New York, 1943, p. 2. 

* Jacobson, N., “The Radical and Semi-simplicity for Arbitrary Rings,” Am. J. Math., 
67, 300-320 (1945). 

* Dubreil, P., “Contribution a la theorie des demi-groupes,”” Mem. Acad. Sct. Inst. 
France (2), 63, No. 3, 52 (1941). 

® McCoy, N. H., Rings and Ideals, Math. Assce. Am., Buffalo, 1948, p. 137. 
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REPRESENTATIONS OF SEMISIMPLE LIE GROUPS ON A 
BANACH SPACE 


By HARISH-CHANDRA 
DEPARTMENT OF MATHEMATICS, COLUMBIA UNIVERSITY 
Communicated by O. Zariski, January 26, 1951 


Let G be a connected semisimple Lie group and § a Banach space. By 
a tepresentation of G on § we mean a mapping w which assigns to every 
x €G a bounded linear operator (x) on § such that the following two con- 
ditions are fulfilled: 


(1) (xy) = w(x)r(y) and r(1) = J (here | is the unit element of G 
and J the identity operator on ). 

(2) The mapping (x, G, is a continuous mapping 
of G X Hinto H. 

The object of this note is to announce a few theorems on these representa- 
tions. No attempt is made to give proofs here. A detailed account with 
complete proofs will appear elsewhere in another paper. 

Let R and C, respectively, be the fields of real and complex numbers. 
Let go be the Lie algebra of G and g the complexification of go. We denote 
the universal enveloping! algebra of g by B. Let C,°(G) be the set of all 
complex-valued functions on G which are indefinitely differentiable every- 
where and which vanish outside a compact set. Let m be a representation 
of Gon § and V the set of all elements in § which can be written as finite 
linear combinations of elements of the form 


SS dx (eH, fe C.°(G)) 
where dx is the left invariant Haar measure on G.  V is called the Garding 


subspace’ of § (with respect to r). For every X € g we can define a 
linear transformation my(X) of V into itself such that 


= tim (x(exptX) —I}¥ te R). 
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The mapping X — ry(X) is a representation of go and therefore it can be 
extended uniquely to a representation ry of Bon V’. Let Ye V. Consider 
ry(B)y where the bar denotes closure in . It turns out that in general 


ry(B)y is not invariant under #(G). In order to avoid this unpleasant 
state of affairs we replace the Garding subspace V by the space of all well- 
behaved elements. This is defined as follows. Let U be a subspace of 
(not necessarily closed). We say that l’ is well-behaved under 7 if the 
following conditions hold. 

(1) There exists a representation ry of go on U such that 


= Lim | r(exp tX) — (X € Qo, Pe U). 


(2) For any continuous linear function f on § and y ¢ U the function 
f(x(x)p) eG) 


isan analytic function onG,  Itis clear that if U;, two well-behaved 
subspaces of § then U; + U, is also well-behaved. From this it follows 
that the union W of all well-behaved subspaces in § is itself a well-behaved 
subspace. An element will be called well-behaved if « W and W 
is called the space of all well-behaved elements. It is clear that the map- 
ping X — mw(X) (X € Qo) can be extended uniquely to a representation 
mw of Bon W’. The following theorem justifies the notion of well-behaved 
elements. 

THEOREM 1. Let ¥ be a weil-behaved element of S. Then mw(B)y is in- 
variant under x(G). 

Let G* be the adjoint group of G and let A* be a maximal compact sub- 
group of G*.. Let A be the complete inverse image of A* in G. Then K 
is connected though it is not necessarily compact. Let P be the set of all 
equivalence classes of finite-dimensional irreducible representations of K. 
Let De PandyeH. We say that y transforms under #(K) according to T 
if the space U spanned by x(u)y for all uw ¢ K is finite-dimensional and the 
representation of K induced on U is fully reducible into a direct sum of ir- 
reducible components each of which lies in D. Let Op be the set of all 
elements in § which transform under #(K) according to D. Put Wp = 
WA Gp. Let Z be the center of G and 3 the center of B. Let V be the 
G4rding subspace of § and wy the representation of 8 on V as defined 
above. We shall say that x is a quasi-simple representation of G if there 
exist homomorphisms ¢ and x of Z and 3 respectively, into C such that the 
following two conditions hold: 


(1) (dé Z). 


| 
| 
| 
= 
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In case condition (2) is satisfied we say that m has the character x. 
THEoreM 2. Let x be a quasi-simple representation of G on $. Then 
> Wp ts dense in § and Hp = Wp (Be P). 

DeP 
Here >> Wp is the space consisting of all finite linear combinations of 
DeP 


clementsin VU Wp. 
DeP 


TuHrorem 3. Let be quasi-simple and let Wp. Put U = 
DeP 


rw(B)y and Up = U Op(DeP). Then = Upand 


DeP 
dim L'y < @ for every D ¢ P. 

Let V be two subspaces of ©. We write U > Vor V < UifU2 V 
and U # V. Let U and V be two closed subspaces of § invariant under 
n(G). We shall say that V is maximal in LU if U > V and there exists no 
closed invariant subspace U, such that U > U; > V. 

THroreM 4. Let w be a quasi-simple representation of G on a Banach 
space § # {0}. Then there exist two closed invariant subspaces U and V 
in $ such that V is maximal in U. 

Now we shall consider the special case when is a Hilbert space and + 
is a unitary representation, From Theorem 4 we deduce the following 
result. 

THEOREM 5. Let m be a quasi-simple uniiary representation of G on a 
Hilbert space § # {0}. Then there exists a closed invariant subspace U of 
§ such that {0} is maximal in U (i.e., U > {0} and U is irreducible under 
m(G)). 

The above theorem has the following significance in relation to the theory 
of factors of Murray and von Neumann.’ Let be a unitary representa- 
tion of G on a Hilbert space §. Let % be the smallest weakty closed alge- 
bra of bounded operators on § containing r(G). It is known‘ that if 
is a factor, i.e., if the center of YU consists of scalar multiples of I, then x 
is quasi-simple. Therefore in this case Theorem 5 is applicable and so it 
follows from the results of Murray and von Neumann? that ¥%f must be a 
factor of type /, or J... This shows that factors of Type II and Type III 
cannot arise from a unitary representation of a semisimple Lie group on a 
Hilbert space. For the consequences of this result we refer the reader to 
the above-quoted paper of Mautner. 

Let De P. We say that D occurs in if Sp {O}. It is known that 
all unitary irreducible representations of G are quasi-simple. 

Turorem 6. Let x be a homomorphism of 3 into C such that x(1) = 1 
and let De P. Let E(x, D) denote the set of all representations x of G which 
have the following properties: 

(1) 4s an irreducible unitary representation of G on some [Hilbert space. 

(2) D occurs in x. 


| 
| 
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(3) mhas the character x. 

Then &(x, D) contains only a finite sinnies of inequivilent representations 

Let d(D) denote the degree of any representation in D. Suppose d(D) = 
1, re (x, D) and OH is the representation space of x. Then it can be shown 
that dim H; = | and the maximum number of inequivalent representations 
in G(x, D) is not greater than the order of the Weyl group of g with respect 
to some Cartan subalgebra. Moreover if G is a complex semi-simple Lie 
group all representations in (x, D) are equivalent (provided d(D) = 1). 

Let be an irreducible unitary representation of G on a Hilbert space 
©. For any D « P let Hp denote the orthogonal projection of § on Dr. 
Since 7 is irreducible and unitary it is quasi-simple and therefore it follows 
from Theorem 4 that dim 9: < ©. Hence the function gp"(x) = 
Sp(Ep m(x)Ep)(x € G) is well defined. It is an analytic function on G 
and gp"(uxu~') = gp"(x) (we K). 

THEOREM 7. Let m, and mz be irreducible unitary representations of G on 
two Hilbert spaces. Let P., be the set of all elements in P which occur in m. 


Suppose that for some and ce C, 
ep = 
Then x, and m, are equivalent. Conversely if m, and m, are equivalent 
yp” = yn": 
for all De P. 

In case d(D) = | the function gp" is the same as the spherical function 
introduced by Gelfand and Naimark.® Now suppose G is a complex semi- 
simple Lie group and d(D) = 1. Then we have seen above that the repre: 
sentation m is completely determined within equivalence by its character x. 
Actually in this case an explicit formula for gp" in terms of x can be ob- 
tained quite easily. This formula is very similar to the one given by Gel- 
fand and Naimark® for representations of the principal series. 


' See, for example, Harish-Chandra, Ann. Math., 50, 900-915 (1949). 

* Garding, L., Proc. Nati. Acap. Scr., 33, 331-332 (1947). 

§ Murray, F. J., and von Neumann, J., Ann. Math., 37, 116-229 (1936). 

* See Mautner, F. 1., Ann. Math., 52, 528-556 (1950). 

5 Gelfand, I. M., and Naimark, M. A., Doklady Akad. Nauk SSR (N. S.), 63, 22! 
(1948). 
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A NON-REFLEXIVE BANACH SPACE ISOMETRIC WITH ITS 
SECOND CONJUGATE SPACE 


By Ropert C. JAMES 


UNIVERSITY OF CALIFORNIA 
Communicated by J. von Neumann, December 5, 1950 


A Banach space B is isometric with a subspace of its second conjugate 
space B** under the ‘natural mapping’’ for which the element of B** 
which corresponds to the element x, of B is the linear functional Fr, de- 
fined by Fz,(f) = f(xo) for each f of B*. If every F of B** is of this form, 
then B is said to be reflexive and B is isometric with B** under this natural 
mapping. The purpose of this note is to show that B can be isometric 
with B** without being reflexive. The example given to show this is a 
space isomorphic with a Banach space known to not be reflexive, but to 
be isomorphic with its second conjugate space.! . 

A sequence {x"} of elements of a Banach space B is said to be a basis 
for B if for each x of B there is a unique sequence of numbers {a,} such 


that x = Sa! in the sense that Jim lix — Daal = 0. A fundamental 


sequence Por is a basis if and only i if haves is a positive number € such that 


n+p n 

|S > for any positive integers n and p and numbers 
1 1 

ja;}.2 Ife = 1, the basis will be called an orthogonal basis. But for any 


n 
basis {x"}, = for x = defines a new norm ||| {|| 
1 1 


which is equivalent to |! || and for which {x"} is an orthogonal basis.* 
Hence if B has a basis {x"} for which lim ||f|!, = 0 for each f of B*, where 


|f\|, is the norm of fon x"*+! @x"t? ® ..., then the following theorem de- 
scribes B** completely if the basis is orthogonal and describes B** to 
within an isomorphism if the basis is not an orthogonal basis. 
Tueorem. Let B be a Banach space with an orthogonal basis {2"| for 
which lim \\f\|, = 0 for each f of B*, where ||f\|, is the norm of f on 2"*! ® 


2"*2@.... Then |g") is a basis for B* if g"(z™) = 5), for each n and m. 
If F B**, then \|F\| = lim 2‘\|, where = F(g'). If the sequence 
n—> 1 


{F,| is such that lim ||SOF 2‘\| < + ©, then F ¢ B** if one defines F(f) = 
n—> 1 


fi for each f = Lifg' of B*. 
Proof: It has been previously known that {g"} is a basis for B*.4 It 


j 
{ 
/ 
| 
n 
oo @ 
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follows from this that F(f) = LPs for each F of B** and each f = Sf 
of B*, where F, = F(g'). But, for each f = | 
[Fe For a fixed n, let u" = DFat Define a linear functional h by 


A(z‘) = Ofori > mand h(u") = |\u"\|. Then |h(au* + = 
n+l 
< |lau" + az'‘||. Thus = 1 on @2"t? @®.... 


Extend h to all of B so that ||h|| = 1 on B. Then, for this h, h = hyg' 
T 


with h, = Ofori > n, so that = = |h(u")| = < || FI). 
1 1 
Since this can be done for each n, it follows that || | > Fel for each n 


n 
and ||F| > lim ||SOFz‘!. It has thus been shown that lim (Dre = 


|F\| for each element F = {F,} of B**. Now suppose me {F, ' is a se- 
quence such that lim M < +o. Then |! < 2M. 


Thus for any fixed B*, | Fifi = |f( F.z')| < ||f\|,(2M), so that 


it follows from lim |! f), = 0 that oF f, is convergent. Thus F(f) = 


© 


is defined for each f B* and ||F\| lim 
1 


n—> 


Example: For x = (x1, X2, Xs, ), let 


where the 1. u. b. is over all positive integers n and all finite increasing sequences 
of at least two positive integers pi, Po, ..., Pri. Let B be the Banach space 
of all x for which |\x'\ is finiteand lim x, = 0. Then B is isometric with 
no 
B**, but 1s isometric under the natural mapping with a closed maximal linear 
subspace of B**. 
Proof: For x = (x1, 2, ...), let 


n 


where the 1. u. b. is over all positive integers m and finite increasing se- 


| 
/ 
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quences of positive integers pi, pr, ..., Panga. It followsirom lim x, = 


Qand > |x, — x,| that > |x,| for each p. Clearly |||x||| > |x,| for 
each p. But by grouping alternating terms of (1) and isolating x,,, one 
gets <1. tx», | + + Xp,)* + (Xp, _3 + 

extra terms can be introduced in (2) to give a sum of type (1), except for 
replacing (xp...) DY — | < Since 
< ||x!| < 3]||x ||, these two norms are equivalent. But the Banach space 
of all x = (x1, %, ...) for which lim x, = 0 and |||x/|| is finite is known 


to not be reflexive, but to be isometric under the natural mapping with a 
closed maximal linear subspace of its second conjugate space.' Hence 
' this is also true of the space B. , 
Let 2" = (0,0, ...,0, 1,0, ...) be the element of B whose components 
are all zero except for the nth component, which is 1. Then z! @ 2@. 


= B, so that {z"} is an orthogonal basis for B if Dae" 4 cy bz'|| > 


|| for all numbers {a,{ and {4,} and positive n p. Since 


Sve 2’ has only a finite number of non-zero components, a sequence /p;, Pp», 
1 


Can be chosen so that 


n k 
= [= (ap, ~ Ap + (3) 
i i=l 


where Gr = Oifr = n. If peyi <n, then it is immediate from (1) and (3) 
n 

that Dae! + > If peu > mn, then each p, with 


Pp, > ncan be wana by some p; > n + p without changing the value of 


(3), since a, = Oifr > n. But it wili then again follow from (1) and (3) 
n+p 


that > For B with the norm ||| |/|, and 
1 
hence i for B with the norm |! ||, it is known that lim |/f\|, = 


where || is the norm of f on 2"*! 2"*? ....1. Hence by Theorem 1 
above, B** is the space of all F = (Fi, Fy, ...) for which | F)) = lim 


n> 


n 
| $°F 2"! is finite. Thus for F to belong to B**, itis necessary that lim F, 
1 


exist. Consider the correspondence: 


...)= 

(Fi, Fa, )= F,. 
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n 
To show that = |/F;||, first consider a sum [= (%», — + 


i=l 


If p, = 2, this is equal to [35% — + 


n 
(Feat If p, = 1, it is equal to — F,,,,-1)* + 


Fy)? (Fy if N> Pn+1 — land Fy is replaced 


n 
by zero. Since |!}>F,z‘) is a monotonically increasing function of n, it 
1 
n 
follows that ||x!| < |! F;||, where |! F,|| = lim Now consider a sum 
n> 1 


— + — Fp,)? |, formed for the element >) Fis‘, 


1 
where F, is to be replaced by Oif p > m. If Pasi < m, then this sum is 


equal to [= + 1)? |. Now suppose 
1= 
k-1 
that > m, but < mifi < k. Then the sum becomes [= (F,, 
k-1 


+ (F,,)? + [= (Xp — Xpitt + 1)? + (Xp, + 


+ (m Thus ||x!| > for each Hence = || 


and x + F, is an isometry with domain equal to B. But if F = (Fi, 
F,, ...) is an element of B**, and lim F, = L, then xe = (—L, Fi — L, 


F, — L, ...) is, by the above, an element of B for which |x| = || F\! and 
xp +> F. Thus the range of the isometry is B**. 


1 James, R. C., “Bases and Reflexivity of Banach Spaces,” Ann. Math., 52, 518-527 
(1950). 

2 Grinblum, M. M., “Certain théorémes sur la base dans un espace du type (B),” C. R, 
(Doklady) Acad. Sci. URSS (N. §.), 31, 428-432 (1941). 

3 Banach, S., Théorie des Opérations Linéaires, Warsaw, 1932, p. 111. 

4 James, loc. ctt., Theorem 3. 
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PROBABILISTIC THEORIES OF RELATIONS 


By MENGER 
ILLINOIS INSTITUTE OF TECHNOLOGY 
Communicated by Marston Morse, January 29, 1951 


Poincaré repeatedly emphasized that only in the mathematical con- 
tinuum the equalities A = Band B = C imply the equality A = C. In 
the observable physical continuum, ‘‘equal’’ means ‘‘indistinguishable,” 
and A = Band B = C by no means imply A = C. ‘The raw result of 
experience may be expressed by the relation 


A=B,B=C,A<C 


which may be regarded as the formula for the physical continuum.” Ac- 
cording to Poincaré, physical equality is a non-transitive relation. 

A closer examination of the physical continuum suggests that in de- 
scribing our observations we should sacrifice more than the transitivity 
of equality. We should give up the assumption that equality is a relation. 
For this assumption implies that for every two elements, A and B, the 
question as to whether or not A and B can be distinguished, is inalterably 
settled. But simple experiments show that, for instance, the simultaneous 
irritation of the same two spots, A and B, on the skin sometimes produces 
one sensation, sometimes two. Only by reliance on the majority of the 
impressions, by processes of averaging and the formation of means, equality 
relations have been artificially created. 

We obtain a more realistic theoretical description of the equality of two 
elements by associating with A and B a number, namely, the probability of 
finding A and B indistingutshable. \n applications, this number would be 
represented by the relative frequency of the cases in which A and B are not 
distinguished. 

In principle, this idea solves Poincaré’s paradox. For if it is only very 
likely that A and B are equal, and very likely that B and C are equal, why 
should it not be less likely that A and C are equal? In fact, why should 
the equality of A and C not be less likely than the inequality of A and C? 

If & (a, 6) denotes the probability that a and b be equal, the following 
postulates seem to be rather natural: 


(1) E(a,a) = 1 for every a; 
(2) E(a, b) = E(b, a) for every a and b; 
(3) K(a, b)-E(b, c) < E(a, c) for every a, b, c. 


(1) and (2) correspond to the reflexivity and symmetry of the equality 
relation, (3) expresses a minimum of transitivity. 
If we call a and b certainly-equal provided that E(a, b) = 1, we obtain 
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an equality relation. All the elements which are certainly-equal to a may 
be united in an “‘equality-set,’’ A. Any two such sets are disjoint unless 
they are identical. We may define /(A, B) as the probability that any 
element of A and any element of B be equal. For this number is inde- 
pendent of the particular choice of the two elements. 

If we set —log H(A, B) = d(A, B), then we readily conclude: 


(1,’) d(A, A) = 0; (1’) d(A, B) > 0; (1.’) d(A, B) + Oif A + B; 
(2’) d(A, B) = d(B, A); 
(3’) d(A, B) + d(B, C) > d(A, C). 


These are Fréchet’s postulates for the distance in a metric space. In 
particular (3’) is the triangle inequality. Conversely, if disjoint sets A, B,... 
form a metric space with the distance d(A, B) and we set £(A, B) = E(a, b) 
= e 44. ® for each element a of A, and bof B, then E(a, d) satisfies the pos- 
tulates (1), (2), (3), of a probability of equality. The systems of probabili- 
ties of equality in a set, S, are thus identical with the systems of negative anti- 
logarithms of the distance for the various possible metrizations of S. 

If S is a straight line, (a, b) = e~” ~ “ is the probability of equality 
corresponding to the euclidean metrization of S. Another example is 
E(a, b) = while the smooth function ~ is ruled out by the 
fact that the corresponding distance d(a, b) = (b — a)?* fails to satisfy the 
triangle inequality. 

Since d(a, b) < , for every two points of a metric space, it follows that 
K(a, b) > 0. We may find it more desirable to assume that every element 
6 differing from a by more than a certain exterior threshold be certainly 
distinguishable from a. But then we have to give up even the minimum of 
transitivity expressed in Postulate (3). 

In the same way as we have just treated equality we may treat the order 
relation ‘‘a precedes 6.’ If we introduce the probability P(a, ) of a pre- 
ceding 6, the following postulates are natural. 

Monotony. If a, < a, < by < by, then Plas, bs) < Play, dy). 

Boundary Conditions. lim P(a,y) = lim P(x, b) = 1. 


The following assumptions reflect properties of the order relation: 
Asymmetry. P(a, b) + P(b,a) = 1. 

Transitivity. P(a, b)-P(b,c) < Pla, c). 

An example of a function satisfying the above postulates is 


a)/2 if b < a 


P(a, 6) = 


forany k > |. Besides, this function satisfies the following condition of 
Homogeneity. P(a,b) = Pla’, b') it b’ a’ = b—a. 
Moreover, the example illustrates the following general fact. If we 


/ 
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wish to satisfy the transitivity postulate we must admit that every number, 
even a very large one, has a positive probability of preceding 0. 

It is clear that probabilistic theories of dyadic, triadic, and higher rela- 
tions can be developed which are based on functions of 2, 3, and more 
variables. This procedure seems to be a natural extension of the idea of re- 
placing qualities by quantities (monadic relations by functions of one vari- 
able). 

But the probabilistic point of view far transcends the domain of relations. 
It is applicable to functions. Instead of a real function of one, two or n 
variables, we may consider the association of a distribution function with 
every element, pair or n-tuple of elements of a set. Of particular geometric 
importance is the probabilistic generalization of the concept of a metric 
space.,! 

* An attempt in this direction was made by the author in the note ‘Statistical Met 
rics,”’ Proc, Nati. Acap. Sct., 28, 535-537 (1942). 


HARMONIC FORMS AND HEAT CONDUCTION 
1; CLOSED RIEMANNIAN MANIFOLDS 
By A. N. MILGRAM AND P. C. ROSENBLOOM 
DEPARTMENT OF MATHEMATICS, SYRACUSE UNIVERSITY 
Communicated by Marston Morse, December 15, 1950 


This paper is concerned with properties of exterior differential forms 
defined on a Riemannian manifold V, of class C’, r > 5. The existence 
and uniqueness of harmonic forms having prescribed periods has for some 
time been known.' ® In this note an analog to the heat equation for 
exterior differential forms is studied and the theory of harmonic forms 
follows as a limiting case analogous to the steady state in heat flow. We 
restrict our considerations in the present note to closed orientable manifolds 
reserving for future publication the adaptation of our method to open 
manifolds. 

In the following we shall adopt the notation used in reference 3 with the 
following minor modification: da = (—1)"?*"(*d*a), where p is the degree 
of a. Then Sa = (dé + éd)a is merely the negative of the operator used 
in reference 5, and the analog of Green's formula has the following form: 
ida, 8) = 68). The symbol! a! will denote the norm, where |! a! = 


A i 
V (a, a). 


If a is a form whose coefficients depend on a parameter t, by i we 


mean the form obtained by replacing each coetticient of @ by its partial 
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derivative with respect to ¢. Our principal concern will be with the 


existence and behavior of solutions of the equations 


Oa 
La = — = 1) 
a a or 0 ( 


(1’) 


where 8 is a given form. 


Let L; = A+ a the operator adjoint to L. Using (a, Ag) — (2, 


Aa) = 0, we remark that 
ra) 
(118, a) (p, L a) at (a, 8). (2) 


We now define a parametrix adapted to the operator L. Let r denote 
a distance function (cf. 3); let 


By = DA 


where A,j,...i9, = det Ay = and A = —r*/2; and let 


ox'dw 


On replacing 8 in (2) by w,(x, y, ¢ — 7) and regarding x and ¢ as param- 
eters in the differentiations and integrations, we obtain for 0 < h< t 


(Liw,(x, y, t — 7), aly, 7)) — (w(x, — 7), L aly, = 
(aly, t — h), w(x, vy, — (aly, 0), y, (4) 


We now let 4 approach zero. 

At this point we need the following lemuinas: 

Lemma 1. The norm of Lyw,(x, y, t) ts bounded by a constant independent 
of x and t. 


Lemma 2. /f ats continuous, then lim (aly, t), Wp(x, h)) = a(x, t). 
h—>0 


The proofs of these lemmas involve the use of osculating Euclidean 
coordinate systems as in reference 3 combined with the standard techniques 
given in references 6 and 7. Actually by these simple computations it is 
shown that the kernel L,w,(x, y, 4 — 7) yields a bounded operator in all the 
classical norms. 

Thus for twice differentiable forms @ we arrive at 


Ss! (Liw,(x, 9, — 7), aly, 7)) — (w(x, ¥, 7), L aly, fdr = 
a(x, t) (aly, 0), w(x, (5) 


i 
and 
La = | 
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Hence if ais a solution of (1’), then a satisfies the Volterra integral equation 


+ Si'(Liw,, ajdr, (6) 
where 


= (aly, 0), w,(x, y, — So'(w,(x, y, — 7), Bly, t))dr (7) 


THeorReM |. 0) and are given forms of class C, and 1s defined 
hy (7), then the integral equation (6) has a unique solution, which is of class 
C’ for t > Oand continuous fort > 0. This solution of (6) is also the unique 
solution of (1') taking on the prescribed initial value a, = a(x, 0) fort = 0. 

The existence and uniqueness of the solution of (6) follows trivially by 
the method of successive approximations, as in the standard theory of 
Volterra’s integral equations. The regularity properties of the solution 
of (6) may be proved directly from the integral equation or by examination 
of the resolvent kernel, which is, of course, the fundamental solution of the 
heat equation. This theorem plays a central role, and dispenses with the 
much more complicated machinery based on the Fredholm theory, which 
was used by previous authors.' * 

Let 7a denote the solution of (1) having initial value a(x) at t = 0. 
Then since (1) is invariant under translations with respect to ¢ and by the 
uniqueness of the solution of (1) it follows that {7,} isa semigroup; i.e., 
= (ef. 8). Moreover if g(t, 7) = Taz), then 


g(t, T) os (2 Ti, (AT = (Ta, AT, a2) 


) 
Tm, — T, axe t, 7), 
so that g(t, r) is a function of ¢ + 7. Observe that this implies 7; is a 
symmetric opérator, for setting r = O gives a2) = (a1, Also 


— = + || — AT una, = 
|| + \| = ?. 


But is a non-negative, non-increasing function of for 


Ta) = 2 (2 Tae Tra) = 2 Te) 


d 
—2(dT dT,a) — a, < 0. 


Hence not only does the limit of | T ,a'\| exist as 1 —> © but also 7',@ con- 
verges in the mean to a form //a. From Lemma | and the relation 
Tiana = = we see that convergence in the mean implies 
uniform convergence and IIa = 7TJ/a which by Theorem | shows that 


Hla is of class C’, and that A//a = = Ila = 0, so that //a is a harmonic 


| 
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form. 

= 
o¢ 
(1), then dais also. Hence if a(x, 0) is closed da is a solution of (1) having 
initial value 0, and by the uniqueness theorem we have da = 0 for all ¢. 
We conclude that if a form is closed initially, it remains closed for all time. 


Noting that Ad = dA,d d it follows that if a(x, t) isa solution of 


If a is a solution of (1), then 


3 
fa-f sa = ta 
at ( ac 


where C is any p chain in V,, and OC denotes the boundary of C. If a@ is 
closed and C is a cycle, the last two terms vanish so that f(a is independent 
of t, and therefore also = Ha. 

THEOREM 2. [f ats a solution of (1) which ts closed for t = O, then it 1s 
closed for all time. Its periods remain constant, and as t~* © a converges 
uniformly to a harmonic form Ha having the same periods. 

This shows that Hodge’s main theorem on the existence of harmonic 
forms with prescribed periods follows immediately from de Rham's 
theorem’ on the existence of closed forms with prescribed periods. (We 
recently learned in conversations with B. Eckmann and W. Hurewicz 
that they have found independently very simple proofs of the latter 
theorem, so that combining their work with the present note provides a 
new approach to the whole theory.) 

We now consider the variational problem of finding for a fixed ¢ > 0, 
the maximum of || 7,a'| under the conditions a) = 1, Za = 0. From the 
integral representation of 7°, in terms of the resolvent kernel of (6), we see 
immediately that 7°, is a completely continuous operator, so that this 
maximum is actually attained. If a(t) is an extremal form for this problem 
and X(t) is the corresponding maximum, then a standard argument shows 
that 7>,a@ = A(t)*a. Straightforward considerations now show that 
A(t) = e * where A, is a certain positive constant. ; 

An elementary calculation shows that the solution of (1) with the 
initial value a is given by the formula 


a = Tia — 
If 8 is independent of ¢, then this takes the simpler form 
a = Ta — 
But — Hpl| < 
which is integrable over [0, ~ |, so that 


lim (a + = y 


1833 
(S) 
(S’) 
(9) 
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exists uniformly, and it is readily seen that 
Ay = 6 — Hp. (10) 


Equation (10) implies the fundamental decomposition theorem of Hodge. 

The operators U, = é(7, — H) form a semigroup with properties 
similar to those of {7',}, and these converge, as t— ©, toa projection //, 
onto the solutions of Aa = — \ya. By repeating this procedure we obtain 
the spectral resolution of 7, in the form 


T,= H+ > 
k=1 


where {dy} are the eigenvalues of the equation Aa = — Aa. The complete- 
ness of the eigenfunctions is quite easy to prove directly. 

We note that another generalization of the Laplace operator was intro- 
duced in reference 10, and also that other treatments of parabolic equations 
occur in references 11 and 12. 
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NATIONAL ACADEMY OF SCIENCES CONFERENCE ON COM- 
PLEMENT 


By MICHAEL HEIDELBERGER 
COLLEGE OF PHYSICIANS & SURGEONS, COLUMBIA UNIVERSITY 


Communicated January 19, 1951 


In the spring of 1950 the writer undertook to organize a conference, to be 
sponsored by the National Academy of Sciences, on some phase of immunol- 
ogy. Asa result of discussions on suitable topics it was decided to assem- 
ble a group, with a wide diversity of interests, in order to throw light from 
all sides on the subject of complement. 

The reasons for the choice of topic were as follows: 

Following the introduction of rigorous, quantitative analytical micro- 
methods for the estimation, in weight units, of antigens and antibodies the 
part played by these substances and their modes of interaction in immunity 
to infectious disease became greatly clarified.'| Much progress has resulted 
from the application of similar methods to the study of complement,? 
which, next to antigens and antibodies, is the most important auxiliary 
substance concerned with immunity. However, the complexity of com- 
plement and its associated phenomena frequently made difficult the inter- 
pretation of analytical data. 

Often the mere combination of antibodies with the infectious agent is 
insufficient to set in motion the biological events which terminate a disease. 
It is under these circumstances that complement plays its most important 
role, for it promotes phagocytosis of infectious agents which have combined 
with antibody and in many instances it mediates their dissolution, or lysis. 
Not only is complement of great interest in this connection, but also be- 
cause of its enormous use in diagnostic tests such as the Bordet-Wasser- 
mann test for syphilis and numerous tests for viruses which are pathogenic 
for man and animals. The mechanism of complement uptake or “‘fixation’’ 
in these commonly used and highly crucial tests is still poorly understood, 
as is also the mechanism of the lysis (hemolysis, in this case) of the red cell- 
anti-red cell complex used as “indicator” for the presence of complement in 
the tests. 

A conference on complement was accordingly held at Ram’s Head Inn, 
Shelter Island, N. Y., from June 12 to 15, 1950. Those who participated 
were: A. R. Dochez, H. Eagle, F. Haurowitz, M. Heidelberger, C. W. 
Hiatt, J. F. Kent, M. H. Loveless, D. A. MacInnes, C. M. MacLeod, F. 
Maltaner, M. M. Mayer, $. Ochoa, A. G. Osler, A. M. Pappenheimer, Jr., 
L. Pillemer, E. Ponder, V. R. Potter, C. E. Rice, W. H. Seegers, and A. J. 
Weil. O. T. Avery, E. E. Ecker, J. W. Fisher and S. B. Hooker were 
unable to attend. 
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The topics of the tentative program were: (1) Chemistry of complement 
and its components. (2) Complement fixation and its relation to antigen- 
antibody reactions, including neutralization of viruses. (3) Immune hemol- 
ysis: kinetics, mechanism, relation to enzymatic processes. (4) Bacteriol- 
ysis; enhancement of phagocytosis, immobilization of spirochetes. (5) 
Complement and the coagulation of the blood. 

Actual discussion developed somewhat as follows: 

1. Definition of Complement (C’).—-Particularly for the benefit of those 
present who had not actually worked with C’ an attempt was made to 
define C’ but agreement could not be reached. This was mainly because 
independent, functionally different methods of measurement lead to con- 
flicting results. For example, the diagnostically important “‘fixation’’ of 
C’, that is, its uptake by the interaction of antigen and antibody, is meas- 
urable as a first approximation by the increase in weight of the antigen- 
antibody complex.? The values so obtained, however, do not run parallel 
to the decrease observed in the hemolytic power of C’,’ the biological prop- 
erty of C’ actually measured in the diagnostic test. The importance was 
emphasized of studies now under way to resolve this discrepancy. It was, 
however, generally agreed that Muir's old definition, ‘‘a fixable constituent 
of fresh serum’’* required supplementation by something like “‘C’ is a com- 
plex of serum constituents which brings about the lysis of red cells, immo- 
bilization of spirochetes, the killing of certain Gram-negative bacteria and 
the opsonization of certain bacteria, all of which functions usually require 
the cooperation of antibodies to the cells involved.’ Other activities and 
manifestations of C’ were discussed but not included in the definition be- 
cause of lack of understanding of the underlying phenomena. 

2. Chemistry of C’.—-Pillemer gave a brief résumé. Of the four com- 
ponents of C’ (C’1, C’2, C'3, C’4) only C’l has thus far been isolated, from 
guinea pig C’, in reasonably pure form as judged by electrophoretic and 
ultracentrifugal analysis.° This work has had far-reaching consequences 
and the urgency of the isolation of the other components in a state of 
purity was discussed by the group. Heidelberger told of the separation 
of C’3 as a single activity from pig C’, which is usually characterized by a 
titer of C’S tenfold that of C’3 in guinea pig C’. 

3. Measurement of C’ by Increase in Weight of Specific Precipitates. 
The technique and limitations of the method were outlined by Heidelber- 
ger. Guinea pig, human, bovine and pig complements all give values of 
C’ nitrogen between 0.04 and 0.06 mg. per ml. when volumes of C’ used are 
extrapolated to 0. No further light was thrown on the ‘solubility effect” 
causing lower values per ml. with larger volumes of C’. It was generally 
agreed that most of the weight added by C’ is derived from C’1. 

4. Complement Fixation as Studied by Measurements of Hemolysis.— 
Recent quantitative studies were described by Osler. While these lead to 
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curves of C’ uptake greatly resembling those of specific precipitation at 
far higher concentrations of antigen and antibody, the maximum C’ uptake 
indicated by the residual hemolysis may be much greater than can be ac- 
counted for on a weight basis if an attempt is made to correlate C’ weights 
and hemolytic titers. Heidelberger presented evidence indicating that be- 
cause of preferential fixation of C’l and C’4 from guinea pig C’ measure- 
ments made before and after fixation might not be comparable owing to a 
shift in the limiting components, i.e., those present in lowest titer. These 
are always C’2 and C’3 in whole guinea pig C’. It was decided that further 
work was necessary for general acceptance of this view and that truly quan- 
titative methods for the measurement of component activities were needed 
since the qualitative methods now available involve large uncertainties. 
Moreover, fixation of C’, as pointed out by Mayer, might be accompanied 
by an inactivation of C’ as well as a loading of C’ molecules onto antigen- 
antibody aggregates. 

5. Mechanism of Hemolysis by Antibody and C’.—-Mayer reviewed some 
of the older work which suggests that the hemolytic process may be enzy- 
mic. Two sets of observations stand out: Mg** is an essential ion® and as 
few as 50 molecules of antibody and 20,000 molecules of C’ (a highly uncer- 
tain value) are required for lysis of one cell, contrasted with about 10° mole- 
cules of a surface-active agent, such as saponin. He then described recent 
kinetic studies’ which suggest that hemolytic antibody is the enzyme and 
that C’ is an accessory factor used up in the process, perhaps as an energy 
donor. The experiments have indicated that the antibody-red cell union 
is reversible, making possible transfer of antibody from cell to cell. This 
finding is compatible with the view that the antibody is the enzyme, but is 
not compelling since some simple lysins can undergo transfer as pointed 
out by Ponder. Mayer has found that with complement in excess, and 
with antibody the limiting factor, the velocity of lysis is inversely propor- 
tional to the total cell concentration (lysed and unlysed). On the other 
hand, in a system with limited complement and excess antibody, the 
velocity of lysis is independent of the cell concentration. It is assumed 
that the lysis of a red cell results from damage to its surface structure and 
that a cell lyses when damage becomes critical. Therefore, the maximal 
rate of lysis should be directly proportional to the rate of production of 
damage (or strain) at the surface of the cell. (This is an approximation; 
strict analysis involves probability theory). From the rate of damage pro- 
dS 


, ) me obtains the rate of damage produc- 
tion per cell by dividing by the total number of cells; the maximal rate of 
dk 1 

= where = total cells and 
dt (dt) is 
FE = unlysed cells. Accordingly, if antibody is considered to be the en- 


duction in the whole system ( 


lysis would then be: — 
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zyme, and conditions are such that antibody is limited (i.e., substrate is in 
excess), the enzyme is working at top speed; therefore, variations of sub- 
strate concentration (i.¢., cells) do not affect the rate at which damage is 
produced, but do affect the maximal rate of lysis in inverse proportion. 
On the other hand, when antibody as the enzyme is in excess, the rate of 
damage production will be proportional to /, and therefore the maximal 
rate of lysis will be independent of the cell concentration. Since these 
kinetic deductions are verified by experimental results obtained with hemo- 
lytic antisera exhibiting relatively rapid transfer of antibody from cell to 
cell, they constitute an argument that the antibody acts like an enzyme, 
and complement is a substance or factor which is used up. 

After returning to the University of Wisconsin, Potter discussed the 
kinetics of hemolysis with Alberty, who had worked out a mathematical 
treatment of a somewhat analogous but simpler situation involving the 
reactivity of eamino groups on proteins. The discussion resulted in the 
formulation of a theory of immune hemolysis by Alberty and Baldwin. 
This will probably be published after the conclusion of correspondence with 
Mayer. 

On the last day of the conference Ponder agreed, with the help of R. T. 
Cox, to attempt the preparation of a clear and comprehensive statement 
about the various schemes which have been proposed for the kinetics of 
hemolysis. There has also been considerable correspondence between 
Ponder and Alberty, as Ponder conceives of Alberty’s theory as a special 
case of the more general theory which he and Cox expect soon to publish. 

In connection with the enzyme function Pillemer pointed out that earlier 
studies with Ecker indicated that C’3 is not used up in hemolysis* and that 
he believes for this reason that C’3 is the enzymic constituent of this sys- 
tem. Haurowitz considered it possible that an antibody-complement 
complex might have enzymic functions. 

6. Relation of C’ to Phenomena of Blood Coagulation.—Seegers reviewed 
the present status of the mechanism of blood coagulation and expressed 
the belief, to which Pillemer agreed, that prothrombin is not identical with 
C’l. However, accelerator globulin and C’ components may be related 
and this deserves further exploration. 

Maltaner reported on the effects of cephalin on blood coagulation and 
C’ activity,’ but without any clear evidence that these processes are directly 
related, 

General discussion followed of the role of Cat * and Mg** in blood coag- 
ulation and C’ activity. The two systems differ in that Ca++ appears to 
be the crucial constituent of the blood clotting system, whereas Mg** 
is the more important ion for the lytic activity of C’. 

7. Conglutination.— Rice reported some of her recent work. The conglu- 
tination reaction is often useful in diagnostic tests involving bird and animal 
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sera. It involves the agglutination of sheep erythrocytes by heated bo- 
vine serum and fresh horse serum and can be used as an indicator system 
in the complement fixation test since it shows analogies to hemolysis. 
The heated bovine serum contributes a substance called conglutinin while 
the fresh horse serum is thought to contribute C’ or some of its components. 
If a specific antigen-antibody reaction occurs it fixes the horse C’ and thus 
prevents conglutination. Several members of the group challenged the 
view that the action of the fresh horse serum is due to complement, so that 
further study of this complex subject was indicated. 


The conference adjourned with all members much stimulated by the 
contacts made and by the uninhibited discussions. These reflected ap- 
proaches to the subject from many entirely different points of view and 
usually centered less about what is known of C’ than about the defects and 
inconsistencies in the information presently available. Ways and means 
were discussed in detail for obtaining some of the most urgently needed 
data. As Haurowitz remarked, all of the participants will in future con- 
sider C’ from more critical and more objective viewpoints. 
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